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figure 1 The PCR amplification cycle. 



primer sequences, but manual inspection of the oligonu- 
cleotide is still necessary to maximize successful PCK 
amplifications. 

The concentrations of the PCR cocktail ingredients are 
also important for product specificity, fidelity and yield Jn 
addition to Taq DNA polymerase and P"mers the PCK 
mixture contains the cofactor magnesium ion (Mg ), "le 
four 2'-deoxyribonucleoside-5'-triphosphates (dNlFs) 
and the buffer. In general, PCR reagent concentrations 
that are too high from 'standard conditions result in 
nonspecific products with high misincorporation errors, 
and those that are too low result in insufficient product. A 
typical 50-til PCR cocktail that contains 0.4 jmiol L o 
e^h primer, 200nmol L" ' of each dNTP. l.Smmol L 
MgCh and 1.25 units Taq DNA polymerase m 



10 mmol L - ' tris-HCl, pH 8.3. 50 mmol L KC buffer 
works well for most PCR applications. The optimal Mg 
concentration, however, may need to be determined 
empirically for difficult target templates. The performance 
and fidelity of Taq DNA polymerase are sensitive o the 
free Mg^" concentration (Eckert and Kunkel 1990). 
which ionically interacts with not only the dNTPs but also 
with the primers, the template DNA, ethylenediaminete- 
traacetic acid (EDTA), and other chelating agents. In most 
cases, the M^^^ concentration will range from 1.0 to 
4.0 mmol L . . „.u 

The number of cycles and the cycle temperature/length 
of time for template denaturation and primer annealing 
and extension are important parameters for high-quality 
PCR results. The optimal number of cycles is dependent on 
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INTRODUCTION 

Since the discovery in 1989 of a second subtype (AT2) of angiotensin II (Ang II) 
receptors (1,2), its functional significance has remained a matter of debate. Expression of 
AT2 receptors was detected in adrenals, uterus, ovary, heart, brain, and at highest levels in 
the developing fetus. This subtype has been associated with reduction of intracellular cGMP 
levels (3-5), modulation of tyrosine phosphorylation (4, 6-9), inhibition of a T-type calcium 
channel (10. II), opening of potassium channels (12, 13), release of prostaglandin (14) and 
production of arachidonic acid (15). Recent molecular cloning of the AT2 receptor cDNA in 
rat (6, 16), mouse (17) and human (18, 19) revealed a structural organization typical of 
G-protein coupled receptors, but did not provide yet a clear understanding of transduction 
pathways and physiological functions associated with this subtype. 

To address the question of AT2 signaling, we have undertaken a detailed phanmacological. 
molecular and functional analysis of this receptor subtype in the murine neuroblastoma cell line 
N 1 E- 1 1 5 (9). These cells express a single molecular class of AT2 receptors and no detectable AT I 
binding site. In these cells. AT2 receptors are functionally coupled to protein tyrosine dephospho- 
rylation. The N I E- 1 1 5 cell line thus constitutes a simple and well-characterized cellular model for 
further analyses of AT2 receptor regulation and function. 



PHARMACOLOGICAL CHARACTERIZATION OF ANG II 
RECEPTORS IN N115 CELLS 

Subtypes (ATI and AT2) of Ang II receptors expressed in neuroblastoma NIE-115 
cells wereanalyzed using the non-selective iodinated Ang II antagonist : '"I-(Sar. Ile)-Angn, 
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Tabic 1. Binding parameters of various ligands (o membranes of N I E- 1 15 cells and COS-AT2 
cells. Values arc from Nalimias el al. (9). n.d. stands for non determined 







NlE-115 


COS-AT2 


Radioligand 


K.d(pM) 


Rmax(fmo!/nig) 


Kd(pM) Cmax(fniol/mg) 


'-M-(Sar,llc)AngII 


233±33 


298t53 


n.d n.d. 


'"I-CGP42I12 


91+19 


32l±42 


75±-t 125+11 






IC50 (nM) 


IC50(nM) 


(Sar,llc)Angll 




0.45tO.09 


0.39+0.07 


CGP42II2 




0.53±0.15 


0.45±0.08 


Angll 




1.22±0.12 


0.84+0.2K 


PD 123319 




4.60±0.21 


4.74+0.55 


DUP 753 




> 10.000 


> 10.000 



and the AT2-se!ective radioligand '"I-CGP 42112. Scatchard analyses revealed that both 
ligands bound to a single class of high afTmity binding sites with the same Bmax (Table I), 
indicating that most if not all Ang 11 receptors were of the AT2 subtype. The absence of the 
ATI subtype was confirmed by the inefTiciency of the ATI-selective ligand DUP 753 to 
compete for binding of '"I-(Sar, ne)-AngII (Table 1). In contrast, the AT2-seleciive ligands 
CGP 42 112 and PD 1233 19 were able to totally displace the binding of '-M-(Sar, I[e)-AngII 
in monophasic curves, indicating the presence of a single pharmacological class of AT2 
receptors, which binding parameters are presented in Table I. 

Fluharty and colleagues (20, 2 1 ) previously reported expression of both AT I and AT2 
receptor subtypes in NIE-1 15 cells. More recently, these authors identified two subpopula- 
tions of AT2 receptors (designated as peak I and peak III, respectively) that could be separated 
by heparine sepharose chromatography (22). These two populations differed in their affinity 
for the AT2-selective ligand PD 1 233 19, their sensitivity to DTTand to analogs of GTP, and 
their reactivity towards a polyclonal anli-AT2 receptor antiserum. Discrepancy between 
these data and ours may be explained by the divergence occuring in cell lines that were 
independently passaged for many years under different culture conditions. The AT2 subtype 
expressed in our cell line shows moderate affinity for PD 1233 19, increased binding in the 
presence of reducing reagent and insensitivity to GTPgS (9), and would thus correspond to 
the so-called "peak III". 



Cloning of AT2 receptor cDNA from NlE-115 

For a further molecular characterization, cDN A clones were isolated from a N 1 E- 1 1 5 
cDNA library by homology screening, using a DNA probe derived by a PGR strategy on the 
basis of (he rat AT2 receptor sequence (9). Nucleotide sequencing revealed that AT2 cDNA 
clones from NIE-1 15 belong to the same molecular subtype as those isolated from rat (16) 
and mouse (17) fetal tissues, as well as from rat pheochromocytoma PC12W cells (6), human 
myometrium (18) and human lung (19). 

One remarkable feature of the mouse AT2 cDNA sequence is the presence of two 
in-frame initiator ATG codons that potentially generate two polypeptides of different length 
(Fig. I). The downstream ATG codon is more likely utilized, as its position is in agreement 
with Kozak's consensus for initiation of translation (23), and corresponds to the single 
initiator ATG codon present in the rat (6, 1 6) and the human (18, 19) AT2 sequences. It cannot 
be excluded however that in certain conditions, translation is initiated at the upstream ATG 
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anding to these 30 amino acids did not afTeci 
mc omamg properties oi tne reccpior expressed in COS cells (data not shown). Translation 
of these additional residues may nevertheless affect the stabiliiy or the routing of the 
molecule inside the cell. 
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fndicated. Residues which differ between mouse and human AT2 sequences are m bold. 



The deduced amino-acid sequence of the mouse AT2 polypeptide (F.g. 2) displays 
all the features characteristic of G-proteir, coupled receptors (24) includ.ng Presenc-^^ 
seven membrarte-spar^ning domains, potential N-linked glycosylat.on s.tes. at^d consensus 
sites foTphosphorylation by protein kinases in the carboxy termmal ta.l. Sequence compan^ 
idicaSgh^onserva^^ 

• ATT ,J^„iorc ^Fit. 2\ Most diverecnt residues are clustered m the extracellular. 
ZZ^^^P^T:^'::^^^^!^^^ D.sU these differences. Hve putative N linked 
glycS'lation sites are conserved at identical positions in the ammo-termmal part of the AT2 

"'"'^hetlTed cDNA clones were functionally expressed in COS cells, yielding 
pharmalolog ct parameters identical to those of endogeneous Ang I -eptors frorn 
NIE 115 cells (Table I). This conf.rms thai NIE-US cells express exclus.vely the AT2 
receptor subtype, and identifies this receptor entity at the molecular level. 

FUNCTIONAL STUDIES OF AT2 RECEPTORS: COUPLING TO 
PROTEIN TYROSINE DEPHOSPHORYLATION 

The pharmacological and molecular ideniif.caiion of a single class of AT2 ^eceptors 
.nNlE-nScerprompfed us tofurtheranalyzethisreceptorsubtypeatthe functional level. 
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The relalionship between AT2 signaling and iniraccllular protein tyrosine phospliorylalion 
was investigated by Western blot analysis ofioial cell lysates using anii-phosphoiyrosine 
antibodies. Treatment of N I E-l 1 5 cells with Ang II led to a reduction in tyrosine phospho- 
rylation of several cndogeneous proleins, of apparent molecular masses 80, 97, 1 20, 1 50 and 
180 kDa, respectively (Fig. 3). This effect was rapid and iransicnt, showing a maximum sit 
5 to 10 minutes and declining at 30 minutes. This response could be blocked in ihe presence 
of an excess of the antagonist (Sar, Ilc)-Ang II and was also obiaincd following treatment 
wiih the AT2-selective agonist CGP 42 1 1 2 (9). 

Preliminary studies of Chinese Hamster Ovary (CHO) cells stably expressing the 
mouse AT2 receptor indicated that in this cellular model, AT2 stimulation also results in the 
transient dephosphorylalion of cellular proteins on tyrosine residues (data not shown). The 
same intracellular efTecl could not however be detected in COS cells transiently iransfected 
with either the murine or the human AT2 receptor cDNA. This may indicate that COS cells 
lack one or several components of Ihe AT2 signaling pathway. It is worth noting that other 
groups reporting the cloning of AT2 receptor cDNA failed to detect any intracellular effect 
of this receptor after both transient (16) and stable (6) overexpression in COS cells. 

Our results showing a reduction of intracellular protein tyrosine phosphorylation may 
be explained by stimulation of a protein tyrosine phosphatase (PTP) activity, or by inhibition 
of a tyrosine kinase-mediated pathway. Given the complexity of intracellular cascades of 
kinases and phosphatases (25-27), it is indeed possible that AT2 receptorsdifierentially affect 
several types of enzymes acting at different levels. In the PCI2W cell line, AT2 receptors 
have been reported to stimulate a PTP activity by a G-protein-independeni mechanism (4, 
7) and to inhibit a PTP through a pertussis toxin-mediated pathway (6. 8). Progress in 
understanding AT2 signaling will greatly benefit from investigating molecular determinants 
of AT2-mediated pathway, including identification of the G-protein(s) involved, search for 
cellular "interaction partners" of the receptor, elucidation of PTPs/TKs activities modulated 
by the AT2 subtype, and identification of cndogeneous phosphoproteins which are dephos- 
phorylated following AT2 receptor activation. 

The relationship between AT2 receptors and tyrosine phosphorylation suggests a link 
with physiological processes such as cell growth, differentiation or cellular adhesion, which 
are known to involve cascades of protein kinases and phosphatases (28). Possible role for 
AT2 receptors in these processes is also consistent with increased levels of expression of this 
subtype during embryonic development (29) and in pathological situations such as vascular 
growth (30) and tissue repair (31). In vivo studies have indeed identified AT2 receptors as 



B 

Figure 3. Ang Il-induccd tyrosine depliosphorylation in N1E-1I5 cells. (A) NIE-115 cells were incubated 
with Ang tl for various periods of lime, before lysis and subsequent immunoblotting with anti-phosphotyrosine 
antibodies. On the left are indicated the positions of standard molecular weight markers, and the arrow on the 
right shows the position of a major phosphoprotein of 80 kDa. (B) After removal of bound antibodies, the same 
blot was incubated with anti-czrin antibodies for internal control of the amount of proteins in each lane. 
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participating in vascular ncointiiua formadon (30) and angiogencsis (32) More recently, 
anti-proliferativc effects of AT2 receptors in primary cultures of rat endothelial coronary 
cells have been demonstrated (33). Other studies performed in cultured cells tend to indicate, 
however, inverse relationship between expression of AT2 receptors and cell proliferation 
(34-36). 



ARE THERE MULTIPLE SUBCLASSES OF AT2 RECEPTORS? 

Studies conducted in the rat brain have pointed out possible heterogeneity of AT2 
receptors. Quantitativcautoradiography revealed differential sensitivity of CGP 42 1 1 2-bind- 
ing sites lo analogs of GTP (37) and to reducing reagents (38, 39). In addition, imniunohis- 
fochemical staining of rat brain sections using anti-AT2 polyclonal antibodies correlated 
with, but was not identical to patterns of radioligand binding (40). As mentioned before, two 
pharmacologically, biochemically and immunologically distinct populations of AT2 recep- 
tors could be identified in N I E- 1 1 5 cells (22). These observations, as well as the diverse and 
sometimes contradictory reports of AT2 signaling, may be due to expression of a single 
molecular entity in different cell environments, and/or may reflect the existence of multiple 
subclasses of AT2 receptors. 

The AT2 receptor gene exists as a single copy in the genome of both human (1 8) and 
mouse (17). This argues against the possibility that several highly homologous AT2 genes 
exist, as is the case for rodent ATI a and ATI b. In addition, the coding region of the AT2 gene 
is devoid of introns. thus ruling out the hypothesis that alternative splicing may lead to 
multiple forms of receptors. It remains possible however that"AT2 receptors" in fact consist 
of several molecular entitites, that do not share high amino-acid sequence homology but that 
all exhibit high afTinity for ligands such as CGP 42 11 2 and PD 1 233 1 9 which define the AT2 
subtype pharmacologically. 

Further characterization of AT2 receptors in a variety of cellular environments is now 
made possible by the development of series of new ligands (41, 42) and may help discrimi- 
nate between putative AT2 receptor subtypes. The molecular basis of AT2 receptor hetero- 
geneity will also certainly be investigated in the coming years, as more and more AT2 cDNA 
clones will be isolated from different tissues and cell lines. 
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TYPE-2 ANGIOTENSIN II RECEPTOR (AT2) . 
P5D052 

P50052 GI:1703214 

swissprot: locus AG22_HUMAN, accession P50G52; 
class: standard. 

extra accessions : Q1301 6, created: Oct 1, 1996. 
sequence updated: Oct 1, 1996. 
annotation updated: Jul 15, 1998. 
xrefs: gi : 747969 , gi: 747970 , gi: 510700 , gi : 
gi: 607812 , gi : 558882 , gi : 558883 , gi : 595934 
1143833 , gi: 1143834 , gi: 860958 , gi : 860959 

xrefs (non-sequence databases): GCRDBGCR_1057 , GCRDBGCR_124 5 , 
GCRDBGCR_187 6, GCRDBGCR_201 1 , GCRDBGCR_2027 , GCRDBGCR_2031, 
GCRDBGCR_2056, MIM 300034 , PROSITEPS00237 

G- PROTEIN COUPLED RECEPTOR; TRANSMEMBRANE; GLYCOPROTEIN. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; 
Primates; Catarrhini; Hominidae; Homo. 

1 (residues 1 to 363) 
Martin, M.M. and Elton, T.S. 
The sequence and genomic organ 
angiotensin II receptor 
Biochem. Biophys. Res. Commun . 
95251653 

SEQUENCE FROM N.A. 
TISSUE=LIVER 

2 (residues 1 to 363) 

Chassagne, C. , Beatty,B.G. and Meloche,S. 

Assignment of the human angiotensin II type 2 receptor gene (AGTR2) 
to. chromosome Xq22-q23 by fluorescence in situ hybridization 
Genomics 25 (2), 601-603 (1995) 
95309939 

SEQUENCE FROM N.A. 
TISSUE=PLACENTA 

3 (residues 1 to 363) 

Koike, G., Horiuchi,M., Yamada,T., Szpirer,C., Jacob, H.J. and 
Dzau,V. J. 

Human type 2 angiotensin II receptor gene: cloned, mapped to the X 
chromosome, and its mRNA is expressed in the human lung 
Biochem. Biophys. Res. Commun. 203 (3), 1842-1850 (1994) 
95032069 

SEQUENCE FROM N.A. 
TISSUE=BLOOD 

4 (residues 1 to 363) 

Tsuzuki,S., Ichiki,T., Nakakubo,H., Kitami,Y., Guo,D.F., Shirai,H. 
and Inagami,T. 

Molecular cloning and expression of the gene encoding human 
angiotensin II type 2 receptor 

Biochem. Biophys. Res. Commun. 200 (3), 1449-1454 (1994) 
94242007 

SEQUENCE FROM N.A. 
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5 {residues 1 to 363) 
Martin, M.M., Su,B. and Elton, T.: 

Molecular cloning of the human angiotensin II type 



recepto 



cDNA 



Biochem. Biophys . Res. Commun . 205 (1), 645-651 (1994) 
95091796 

SEQUENCE FROM N.A. 
TISSUE=LUNG 

6 (residues 1 to 363) 

Lazard,D., Briend-Sutren, M . M . , Villageois , P . , Mattel, M.G., 
Strosberg, A. D. and Nahinias,C. 

Molecular characterization and chromosome localization of a human 
angiotensin II AT2 receptor gene highly expressed in fetal tissues 
Recept. Channels 2 (4), 271-280 (1994) 
95236034 

SEQUENCE FROM N.A. 
TISSUE=PLACENTA 

7 (residues 1 to 363) 
KATSUYA,T. and DZAU,V.J. 
Direct Submission 

Submitted (~JAN-1996) TO EMBL/GENBANK/DDBJ DATA BANKS 

SEQUENCE OF 1-22 FROM N.A. 

TISSUE=BLOOD 

8 (residues 1 to 363) 

WARNECKE,C.H. , HOLZMEISTER, J . , REGITZ-ZAGROSEK, V. and FLECK, E. 
Direct Submission 

Submitted (~JUN-1995) TO EMBL/GENBANK/DDBJ DATA BANKS 

SEQUENCE OF 1-16 FROM N.A. 

TISSUE=UTERUS 

[FUNCTION] RECEPTOR FOR ANGIOTENSIN II. MAY HAVE A ROLE IN CELL 
MORPHOGENESIS AND RELATED EVENTS IN GROWTH AND DEVELOPMENT. 
[SUBCELLULAR LOCATION] INTEGRAL MEMBRANE PROTEIN. 

[TISSUE SPECIFICITY] IN ADULT, HIGHLY EXPRESSED IN MYOMETRIUM WITH 
LOWER LEVELS IN ADRENAL GLAND AND FALLOPIAN TUBE. VERY HIGHLY 
EXPRESSED IN FETAL KIDNEY AND INTESTINE. 

[SIMILARITY] BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 
Location/Qualifiers 
1. .363 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
1..363 

/gene="AGTR2" 
2 1..363 

/gene="AGTR2" 

/product="TYPE-2 ANGIOTENSIN II RECEPTOR" 
1. .45 

/gene="AGTR2 " 
/region_name=" Domain" 
/ not e=" EXTRACELLULAR. " 



/gene="AGTR2 " 

/site_type="glycosylation" 



/gene="AGTR2 " 

/site_type="glycosylation" 



/gene="AGTR2" 

/site_type="glycosylation" 



/gene="AGTR2" 

/site_type="glycosylation" 



/gene="AGTR2 " 

/site_type="glycosylation" 



CBJ^Sequence Viewer 



46. .71 

/gene="AGTR2" 

/region_name= "Transmembrane region" 
/note="l . " 
72. .80 

/gene="AGTR2 " 
/region_name=" Domain" 
/note="CYTOPLASMIC . " 
81. . 102 
/gene="AGTR2" 

/region_name= "Transmembrane region" 

/note="2 . " 

103. .119 

/gene="AGTR2" 

/region_name=" Domain" 

/note="EXTRACELLULAR. " 

120. .140 

/gene="AGTR2 " 

/region narae="Transmembrane region" 
/note="3 . " 
141. .160 
/gene="AGTR2 " 
/region_name="Domain" 
/note="CYTOPLASMIC . " 
161 . .179 
/gene="AGTR2 " 

/region_name="Transmembrane region" 

/note="4 . " 

180. .208 

/gene="AGTR2" 

/region_name=" Domain" 

/not e=" EXTRACELLULAR. " 

209. .234 

/gene="AGTR2 " 

/region_name= "Transmembrane region" 

/note="5 . " 

235. .256 

/gene="AGTR2" 

/region_name=" Domain" 

/note="CYTOPLASMIC . " 

257. .278 

/gene="AGTR2 " 

/region_name="Transmembrane region" 

/note="6. " 

268 . .269 

/gene="AGTR2" 

/region_name="Conf lict " 

/note="CW -> WC (IN REF. 5)." 

272 

/gene="AGTR2" 
/region_name= "Conflict" 
/note="F -> L (IN REF. 4)." 
279. . 285 
/gene="AGTR2" 
/region_name=" Domain" 
/note="EXTRACELLULAR. " 
286. .313 
/gene="AGTR2" 

/region_name=" Transmembrane region" 
/note=;;7_. " 

3 14.. 3 63 

/gene="AGTR2" 
/region_name="Domain" ) 
/note="CYTOPLASMIC. " . . ' 
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Region 323 

/gene="AGTR2" 
/region_name= "Conflict" 
/note="N -> G (IN REF. 4)." 

[N 

1 mkgnstlatt sknitsglhf glvnisgnne stlncsqkps dkhldaipil yyiifvigfl 
61 vnivvvtlfc cqkgpkkvss iyifnlavad llllatlplw atyysyrydw Ifgpvmckvf 
121 gsfltlnmfa siffitcmsv dryqsviypf Isqrrnpwqa syivplvwcm aclsslptfy 
181 frdvrtieyl gvnacimafp pekyaqwsag ialmknilgf iiplifiatc yfgirkhllk 
241 tnsygknrit rdqvlkmaaa vvlafiicwl pfhvltflda lawmgvinsc eviavidlal 
301 pfaillgftn scvnpflycf vgnrfqqklr svfrvpitwl qgkresmscr kssslremet 
361 fvs 
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Show ] difference between I and II as |GenBank/GenPepr 

Revision history for P50052 



Gl 


Version 


Update Date 


Status 


1 


II 


1703214 


n/a 


heu o /LvUh o.oo aivi 









1703214 


n/a 


Jan zu zuu4 y. am 








1703214 


n/a 


NOV 1 1 ZUUo o.z 1 AM 


fT^d 

_£2 






1703214 


n/a 


Apr 1 2uU2 o.oZ rM 


Dead 






1703214 


n/a 


Jun ZUU1 1.1/ rM 


Dead 






1703214 


n/a 


Apr 16 2001 1.2o rM 


Dead 






1703214 


n/a 


Feb 1 ZUUI lU.ZO AM 


Dead 






1703214 


n/a 


Jun 12 2000 2:54 PM 


Dead 






1703214 




Apr 3 2000 11:14 AM 


Dead 


r 


f- 


1703214 


n/a 


Feb 7 2000 1:29 PM 


Dead 


r 


r 


1703214 


n/a 


Jan 31 2000 1:20 PM 


Dead 


r 


r 


1703214 


n/a 


Nov 4 1999 2:07 PM 


Dead 


r 


r 


1703214 


n/a 


Sep 21 1999 11:25 AM 


Dead 


r 


r 


1703214 


n/a 


Jul 21 1998 4:10 AM 


Dead 


r 


r 


1703214 


n/a 


Jun 15 1998 12:21 AM 


Dead 


r 


r 


1703214 


n/a 


Jun 4 1998 4:07 AM 


Dead 


r 


r 


1703214 


n/a 


Apr 10 1998 12:54 AM 


Dead 


r 


r 


1703214 


n/a 


Feb 4 1998 10:19 AM 


Dead 


r 


r 


1703214 


n/a 


Oct 9 1997 9:39 PM 


Dead 


r 


r 


1703214 


n/a 


Dec 3 1996 9:38 PM 


Dead 
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PRI 15-MAR-2004 



)CUS P50052 363 aa 

iFINITION Type-2 angiotensin II receptor (AT2) . 
XESSION P50052 
■:RSI0N P50052 GI: 1703214 

3S0URCE swissprot: locus AG22_HUMAN, accession P50052; 
class: standard. 

extra accessions :Q13016, created: Oct 1, 1996. 
sequence updated: Oct 1, 1996. 
annotation updated: Mar 15, 2004. 

xrefs: gi: 747969 , gi : 747970 , gi: 510700 , gi: 510701 , gi : 607811 
gi: 607812 , gi : 558882 , gi: 558883 , gi: 595934 , gi : 595935 , gi: 
32482004 , gi : 32482005 , gi: 1143833 , gi: 1143834 , gi : 860958 , gi: 
860959 , gi: 1082208 

xrefs (non-sequence databases) : HSSPP34996, GenewHGNC : 338 , MIM 

300034 , GO0005887, GO0004945, GO0004860, GO0006915, GO0007610, 

GO0007166, GO0008217, InterProIPR000276, PfamPFOOOOl, 

PRINTSPR00237, PROSITEPS00237 , PROSITEPS50262 
•:YW0RDS G-protein coupled receptor; Transmembrane; Glycoprotein; 

Polymorphism. 
)URCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; 

Mammalia; Eutheria; Primates; 

1 (residues 1 to 363) 
Martin, M.M. and Elton, T.S. 

The sequence and genomic organization of the hu: 
angiotensin II receptor 

Biochem. Biophys . Res. Commun. 209 (2), 554-562 (1995) 
95251653 
7733925 

SEQUENCE FROM N.A. 
TISSUE=Liver 

2 (residues 1 to 363) 

Chassagne,C., Beatty,B.G. and Meloche,S. 

Assignment of the human angiotensin II type 2 receptor gene (AGTR2) 
to chromosome Xq22-q23 by fluorescence in situ hybridization 
Genomics 25 (2), 601-603 (1995) 



QFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 

CFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 

•ZFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 

QFERENCE 
AUTHORS 



lata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



type : 



95309939 
7790004 

SEQUENCE FROM N.A. 
TISSUE=Placenta 

3 (residues 1 to 363) 

Koike, G., Horiuchi,M., Yamada,T., Szpirer,C., Jacob, H.J. and 
Dzau,V.J. 

Human type 2 angiotensin II receptor gene: cloned, mapped to the X 
chromosome, and its mRNA is expressed in the human lung 
Biochem. Biophys. Res. Commun. 203 (3), 1842-1850 (1994) 
95032069 
7945336 

SEQUENCE FROM N.A. 
TISSUE=Blood 

4 (residues 1 to 363) 

Tsuzuki,S., Ichiki,T., Nakakubo,H., Kitami,Y., Guo,D.F., Shirax,H. 
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JOURNAL 
MEDLINE 
PUBMED 
REMARK 

CFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REMARK 

•:ference 

AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 

iFERENCE 
AUTHORS 
TITLE 
JOURNAL 
REMARK 

■:ference 

AUTHORS 
TITLE 
JOURNAL 
REMARK 

:ference 

AUTHORS 
TITLE 
JOURNAL 
REMARK 

)MMENT 



and Inagami,T. 

Molecular cloning and expression of the gene encoding human 
angiotensin II type 2 receptor 

Biochem. Biophys . Res. Commun. 200 (3), 1449-1454 (1994) 

94242007 

8185599 

SEQUENCE FROM N.A. 

TISSUE=Placenta 

5 (residues 1 to 363) 

Martin, M.M., Su,B. and Elton, T.S. 

Molecular cloning of the human angiotensin II type 2 receptor cDNA 

Biochem. Biophys. Res. Commun. 205 (1), 645-651 (1994) 

95091796 



7999093 



MATURES 

source 



SEQUENCE FROM N.A. 
TISSUE=Lung 

6 (residues 1 to 363) 

Lazard, D., Briend-Sutren, M . M . , Villageois, P. , Mattel, M.G., 
Strosberg, A. D. and Nahmias,C. 

Molecular characterization and chromosome localization of a human 

angiotensin II AT2 receptor gene highly expressed in fetal tissues 

Recept. Channels 2 (4), 271-280 (1994) 

95236034 

7719706 

SEQUENCE FROM N.A. 
TISSUE=Placenta 

7 (residues 1 to 363) 

Kopatz,S.A., Aronstam,R.S. and Sharma,S.V. 
Direct Submission 
Submitted (~JUN-2003) 
SEQUENCE FROM N.A. 

8 (residues 1 to 363) 
Katsuya,T. and Dzau,V.J. 
Direct Submission 
Submitted (~JAN-1996) 
SEQUENCE OF 1-22 FROM N.A. 
TISSUE=Blood 

9 (residues 1 to 363) 
Warnecke,C.H. , Holzmeister 
Direct Submission 
Submitted (~JUN-1995) 
SEQUENCE OF 1-16 FROM N.A. 
TISSUE=Uterus 

This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http: / /www . expasy . ch/sprot 
and http: / /www . ebi . ac . uk/sprot 

[FUNCTION] Receptor for angiotensin II. May have a role in cell 
morphogenesis and related events in growth and development. 
[SUBCELLULAR LOCATION] Integral membrane protein. 

[TISSUE SPECIFICITY] In adult, highly expressed in myometrium wi 
lower levels in adrenal gland and fallopian tube. Very highly 
expressed in fetal kidney and intestine. 

[SIMILARITY] Belongs to family 1 of G-protein coupled receptors. 
Location /Qualifiers 
1. .363 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
1..363 

/gene="AGTR2" 
1. .363 



Regitz-Zagrosek, V. and Fleck, E. 
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/gene="AGTR2" 

/product="Type-2 angiotensin II receptor 
1 . . 45 

/gene="AGTR2" 
/region_name= " Domain " 
/note="Extracellular (Potential) ." 
4 

/gene="AGTR2" 

/site_type="glycosylation" 
/note="N-linked (GlcNAc. . . ) (Potential). 
13 

/gene="AGTR2" 

/site_type="glycosylation" 
/note="N-linked (GlcNAc...) (Potential). 
24 

/gene="AGTR2" 

/site_type="glycosylation" 
/note="N-linked (GlcNAc...) (Potential). 
29 

/gene="AGTR2" 

/site_type="glycosylation" 
/note="N-linked (GlcNAc...) (Potential). 
34 

/gene="AGTR2 " 

/ site_type="glycosylation" 
/note="N-linked (GlcNAc...) (Potential). 
46. .71 

/gene="AGTR2" 

/region_name="Transmembrane region" 
/note="l (Potential)." 
72. .80 

/gene="AGTR2 " 

/ region_name=" Domain" 

/note="Cytoplasmic (Potential) ." 

81. .102 

/gene="AGTR2" 

/region_name= "Transmembrane region" 

/note="2 (Potential)." 

103. .119 

/gene="AGTR2" 

/ region_name="Domain" 

/note="Extracellular (Potential) ." 

120. .140 

/gene="AGTR2" 

/region_name= "Transmembrane region" 

/note="3 (Potential)." 

141 . .160 

/gene="AGTR2" 

/region_name=" Domain" 

/note="Cytoplasmic (Potential) ." 

161 . . 179 

/gene="AGTR2" 

/ re gion_name=" Transmembrane region" 

/note="4 (Potential)." 

180. .208 

/gene="AGTR2" 

/region_name=" Domain" 

/note="Extracellular (Potential) ." 

209. .234 

/gene="AGTR2" 

/ re gion_name=" Transmembrane region" 
/note="5 (Potential)." 
235. .256 
/gene="AGTR2" 
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/region_name=" Domain" 
/note="Cytoplasmic (Potential) ." 
Region 248 

/gene="AGTR2" 
/region_name="Variant " 

/note="R -> K (in dbSNP: 5191) . /FTId=VAR_011849 . " 
Region 257.. 278 

/gene="AGTR2" 

/region name="Transmembrane region" 
/note="6 (Potential)." 
Region 268 

/gene="AGTR2" 
/region_name= "Variant" 

/note="C -> W (in dbSNP: 1042860) . /FTId=VAR_011850 . " 
Region 269 

/gene="AGTR2" 

/region_name="Conf lict" 

/note="W -> C (in Ref. 5)." 
Region 272 

/gene="AGTR2" 

/region_name="Conf lict " 

/note="F -> L (in Ref. 4)." 
Region 279. .285 

/gene="AGTR2 " 

/region_name=" Domain" 

/note="Extracellular (Potential) ." 
Region 286. .313 

/gene="AGTR2" 

/region_name= "Transmembrane region" 

/note="7 (Potential)." 
Region 314. .363 

/gene="AGTR2" 

/region name=" Domain" 

/note="Cytoplasmic (Potential) ." 
Region 323 

/gene="AGTR2 " 

/region_name="Conf lict" 

/note="N -> G (in Ref. 4)." 

IN 

1 rakgnstlatt sknitsglhf glvnisgnne stlncsqkps dkhldaipil yyiifvigfl 
61 vnivvvtlfc cqkgpkkvss iyifnlavad llllatlplw atyysyrydw Ifgpvmckvf 
121 gsfltlnmfa siffitcmsv dryqsviypf Isqrrnpwqa syivplvwcm aclsslptfy 
181 frdvrtieyl gvnacimafp pekyaqwsag ialmknilgf iiplifiatc yfgirkhllk 
241 tnsygknrit rdqvlkmaaa vvlafiicwl pfhvltflda lawmgvinsc eviavidlal 
301 pfaillgftn scvnpflycf vgnrfqqklr svfrvpitwl qgkresmscr kssslremet 
361 fvs 
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1: P35374 . Type-2 angiotensi...[gi:543779] 



)CUS 

;finition 

;CESSION 

;rsion 

(SOURCE 



)URCE 
ORGANISM 



:ference 

AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 

:ference 

AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 

:ference 

AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
REMARK 

:ference 

AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REMARK 



linear 



ROD 15-JUL-1998 



P35374 363 aa 

TYPE-2 ANGIOTENSIN II RECEPTOR (AT2). \ 

P35374 , ^ 

P35374 GI:543779 

swissprot: locus AG22_MOUSE, accession P35374; 
class: standard, 
created: Jun 1, 1994. 
sequence updated: Jun 1, 1994. 
annotation updated: Jul 15, 1998. 

xrefs: gi : 455900 , gi : 455901 , gi : 439862 , gi : 439863 , gi : 443584 , 
gi- 474274 , gi: 487848 , , gi: 607834 , , gi : 543186 
xrefs (non-sequence databases): GCRDBGCR_08 90, GCRDBGCR_1007 , 
GCRDBGCR_1010, MGI87966, PROSITEPS00237 

G-PROTEIN COUPLED RECEPTOR; TRANSMEMBRANE; GLYCOPROTEIN; 

PHOSPHORYLATION. 

Mus musculus (house mouse) 

Mus musculus 

Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; 
Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (residues 1 to 363) 

Nakajima,M., Mukoyama, M. , Pratt, R.E., Horiuchi,M. and Dzau,V.J. 
Cloning of cDNA and analysis of the gene for mouse angiotensin II 
type 2 receptor 

Biochem. Biophys . Res. Commun . 197 (2), 
94092107 

SEQUENCE FROM N.A. 
STRAIN=BALB/C; TISSUE=FETAL 

2 (residues 1 to 363) 

Ichiki,T., Herold,C.L., Kambayashi , Y . 
Cloning of the cDNA and the genomic DNA of the 
type 2 receptor 

Biochim. Biophys. Acta 1189 (2), 247-250 (1994) 
94122216 

SEQUENCE FROM N.A. 
STRAIN=BALB/C 

3 (residues 1 to 363) 

Nahmias,C., Cazaubon, S . M . , Sutren,M., Masson,M., 
Villageois, P. , Elbaz,N. and Strosberg,A.D. 
Molecular and functional characterization of angiotens 
receptor in neuroblastoma NlE-115 cells 
Adv. Exp. Med. Biol. 396, 167-173 (1996) 
96337434 

SEQUENCE FROM N.A. 

4 (residues 1 to 363) 

Horiuchi,M., Koike, G., Yamada,T., Mukoyama, M., Nakajima,M. 
Dzau,V. J. 

The growth-dependent expression of angiotensin II type 
is regulated by transcription factors interferon regulato 
factor-1 and -2 

J. Biol. Chem. 270 (34), 20225-20230 (1995) 
95378283 

SEQUENCE FROM N.A. 
STRAIN=BALB/C; TISSUE=LIVER 



393-399 (1993) 



Bardhan,S. and Inagami,T. 

mouse angiotensin 



Lazard, D. 



II AT2 



recepto 
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MATURES 

source 



[FUNCTION] RECEPTOR FOR ANGIOTENSIN II. MAY HAVE A ROLE IN CELL 
MORPHOGENESIS AND RELATED EVENTS IN GROWTH AND DEVELOPMENT. 
[SUBCELLULAR LOCATION] INTEGRAL MEMBRANE PROTEIN. 
[TISSUE SPECIFICITY] ABUNDANT IN FETUS, LOWER LEVELS IN ADULT 
BRAIN. 

[PTM] CARBOXYL-TERMINAL SER OR THR RESIDUES MAY BE PHOSPHORYLATED . 
[SIMILARITY] BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

Location/Qualifiers 

1. .363 

/organisin="Mus musculus" 
/db_xref="taxon: 10090" 
1. .363 

/gene="AGTR2" 
T 1..363 

/gene="AGTR2" 

/product="TYPE-2 ANGIOTENSIN II RECEPTOR" 
1. .45 

/gene="AGTR2" 
/region_name=" Domain" 
/note="EXTRACELLULAR. " 
4 

/gene="AGTR2" 

/site_type="glycosylation" 
13 

/gene="AGTR2 " 

/site_type="glycosylation" 
24 

/gene="AGTR2" 

/site_type="glycosylation" 
29 

/gene="AGTR2 " 

/site_type="glycosylation" 
34 

/gene="AGTR2 " 

/site type="glycosylation" 
4 6.. 71 

/gene="AGTR2" 

/region_name="Transmembrane region" 

/note="l . " 
1 72. .80 

/gene="AGTR2" 

/region name="Domain" 

/note="CYTOPLASMIC . " 
1 81. .102 

/gene="AGTR2" 

/region_name="Transmembrane region" 

/note="2. " 
1 103. .119 

/gene="AGTR2" 

/region name="Domain" 

/note="EXTRACELLULAR. " 
T 120. .140 

/gene="AGTR2" 

/region_name="Transmembrane region" 

/note="3. " 
Q 141. .160 

/gene="AGTR2 " 

/region_narae=" Domain" 

/note="CYTOPLASMIC. " 
n 161. .179 

/gene="AGTR2" 

/region_name="Transmembrane region" 
/note="4 . " 
n 180. .208 
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/gene="AGTR2" 
/region_name=" Domain" 
/note="EXTRACELLULAR. " 
Region 209.. 234 

/gene="AGTR2" 

/region name="Transmembrane region" 

/note="5 . " 
Region 235. .256 

/gene="AGTR2" 

/region_name=" Domain" 

/note="CYTOPLASMIC. " 
Region 257. .278 

/gene="AGTR2" 

/region_name="Transmembrane region" 

/note="6." 
Region 279. .285 

/gene="AGTR2" 

/region_name=" Domain" 

/note="EXTRACELLULAR. " 
Region 286. .313 

/gene="AGTR2" 

/region_name="Transmembrane region" 

^_ ,.. „/nQt,e="7_. " 

Region 314.. 363 

/gene="AGTR2" ^-.^^ 
/region_name="Domain" 

- Loote^XTLQSJu^SMi&r-'^ — -" 

Site 354 

/gene="AGTR2" 

/site_type="phosphorylation" 
/note=" (BY PKC) . " 

5IN . ^ . ^ 

1 mkdnfsfaat srnitssrpf dnlnatgtne safncshkps dkhleaipvl yymifvxgfa 
61 vnivvvslfc cqkgpkkvss iyifnlalad llllatlplw atyysyrydw Ifgpvmckvf 
121 gsfltlnmfa siffitcmsv dryqsviypf Isqrrnpwqa syvvplvwcm aclsslptfy 
181 frdvrtieyl gvnacimafp pekyaqwsag ialmknilgf iiplifiatc yfgirkhllk 
241 tnsygknrit rdqvlkmaaa vvlafiicwl pfhvltflda Itwmgiinsc eviavidlal 
301 pfaillgftn scvnpflycf vgnrfqqklr svfrvpitwl qgkretmscr kgsslremdt 
361 fvs 
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ORGANISM 



iFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 

:ference 

AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 

:ference 

AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 

:ference 

AUTHORS 
TITLE 



P35374 363 aa linear ROD 15-MAR-2004 

Type-2 angiotensin II receptor (AT2) . 

P35374 

P35374 GI:543779 

swissprot: locus AG22_MOUSE, accession P35374; 

class: standard. 

created: Jun 1, 1994. 

sequence updated: Jun 1, 1994. 

annotation updated: Mar 15, 2004. 

xrefs: gi : 455900 , gi : 455901 , gi : 439862 , gi: 439863 , gi: 443584, 
gi: 474274, gi: 487848 , gi: 1881537 , gi : 607834, gi: 607836 , gi: 
13277866 , gi : 13277867 , gi: 2137148 

xrefs (non-sequence databases) : HSSPP34996, MGI87966, 
InterProIPR000276, PfamPFOOOOl, PRINTSPR00237 , PROSITEPS00237 , 
PROSITEPS50262 

G-protein coupled receptor; Transmembrane; Glycoprotein; 

Phosphorylation. 

Mus musculus (house mouse) 

Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (residues 1 to 363) 

Mukoyama,M., Pratt, R.E., Horiuchi,M. and Dzau,V.J. 



angiotensin II 



Nakajima,M. . 

Cloning of cDNA and analysis of the gene for : 
type 2 receptor 

Biochem. Biophys. Res. Coitimun . 197 (2), 393-399 (1993) 

94092107 

8267573 

SEQUENCE FROM N.A. 
STRAIN=BALB/c; TISSUE=Fetal 

2 (residues 1 to 363) 

Ichiki,T., Herold,C.L., Kambayashi , Y . , Bardhan,S. and Inagami,T. 
Cloning of the cDNA and the genomic DNA of the mouse angiotensin II 
type 2 receptor 

Biochim. Biophys. Acta 1189 (2), 
94122216 
8292631 

SEQUENCE FROM N.A. 
STRAIN=BALB/c 

3 (residues 1 to 363) 
Nahmias,C., Cazaubon 



247-250 (1994) 



Sutren,M., Masson,M., Lazard,D., 
Villageois, P. , Elbaz,N. and Strosberg,A.D. 
Molecular and functional characterization of angiotensin II AT2 
receptor in neuroblastoma NlE-115 cells 
Adv. Exp. Med. Biol. 



167-173 (1996) 



396 

96337434 
8726696 

SEQUENCE FROM N.A. 
4 (residues 1 to 363) 
Horiuchi,M., Koike, G., Yamada, 
Dzau,V. J. 

The growth-dependent expressio 

is regulated by transcription factors interferon regulatory 



Mukoyama,M., Nakajima,M. and 
of angiotensin II type 2 receptor 
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TITLE 

JOURNAL 
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REMARK 

)MMENT 



factor-1 and -2 

J. Biol. Chem. 270 (34), 20225-20230 (1995) 

95378283 

7650042 

SEQUENCE FROM N.A. 
STRAIN=BALB/c; TISSUE=Liver 
5 (residues 1 to 363) 

Strausberg, R. L. , Feingold, E . A . , Grouse, L.H., Derge,J.G., 
Klausner, R. D. , Collins, F. S . , Wagner, L., Shenmen, C . M . , Schuler , G . D . , 
Altschul, S. F. , Zeeberg,B., Buetow,K.H., Schaef er , C . F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko, L . , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton, M. , Scares, M.B., Bonaldo, M. F. , Casavant, T.L. , 
Scheetz, T.E. , Brownstein, M. J. , Usdin,T.B., Toshiyuki , S . , 
Carninci,P., Prange,C., Raha,S.S., Loquellano, N . A . , Peters, G. J., 
Abramson, R. D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan,K. J. , Malek,J.A., Gunaratne , P . H . , Richards, S., 
Worley,K.C., Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon, D.K. , Muzny,D.M., Sodergren, E . J. , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman, M. , Madan,A., Rodrigues , S . , 
Sanchez, A., Whiting, M., Madan,A., Young, A. C, Shevchenko, Y. , 
Bouf fard, G.G. , Blakesley, R. W. , Touchman, J. W. , Green, E.D., 
Dickson, M. C. , Rodriguez, A. C. , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterfield,Y.S.N., Krzywinski, M. I . , Skalska,U., Smailus , D . E . , 
Schnerch,A., Schein,J.E., Jones, S. J. M. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci . U.S.A. 99 (26), 16899-16903 (2002) 

22388257 

12477932 

SEQUENCE FROM N.A. 

STRAIN=C57BL/6J; TISSUE=Mammary gland 

This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http: //www. expasy. ch/sprot 
and http : / /www . ebi . ac . uk/sprot 

[FUNCTION] Receptor for angiotensin II. May have a role in cell 
morphogenesis and related events in growth and development. 

[SUBCELLULAR LOCATION] Integral membrane protein. 

[TISSUE SPECIFICITY] Abundant in fetus, lower levels in adult 
brain . 

[PTM] Carboxyl-terminal Ser or Thr residues may be phosphorylated . 
[SIMILARITY] Belongs to family 1 of G-protein coupled receptors. 

Location /Qualifiers 
! 1..363 

/organism="Mus musculus" 

/db_xref="taxon: 10090" 

1. .363 

/gene="AGTR2" 
.n 1..363 

/gene="AGTR2" 

/product="Type-2 angiotensin II receptor" 
1 1. . 45 

/gene="AGTR2" 
/region_name=" Domain" 
/note="Extracellular (Potential) ." 




/gene="AGTR2" 
/site_type="glycosylation' 
/note="N-linked (GlcNAc. . , 



) (Potential) . 
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/gene="AGTR2 " 

/site_type="glycosylation" 
/note="N-linked (GlcNAc...) (Potential). 
24 

/gene="AGTR2" 

/site_type="glycosylation" 
/note="N-linked (GlcNAc...) (Potential). 
29 

/gene="AGTR2 " 

/site_type="glycosylation" 
/note="N-linked (GlcNAc.) (Potential). 
34 

/gene="AGTR2" 

/site_type="glycosylation" 
/note="N-linked (GlcNAc...) (Potential). 
46. .71 

/gene="AGTR2" 

/region_name= "Transmembrane region" 
/note="l (Potential)." 
72. .80 

/gene="AGTR2" 
/region_name="Domain" 
/note="Cytoplasmic (Potential) ." 
81. .102 
/gene="AGTR2" 

/region_name= "Transmembrane region" 

/note="2 (Potential)." 

103. .119 

/gene="AGTR2" 

/region_name=" Domain" 

/note="Extracellular (Potential) ." 

120. .140 

/gene="AGTR2" 

/region_name= "Transmembrane region" 
/note="3 (Potential)." 
141. .160 
/gene="AGTR2 " 
/region_name=" Domain" 
/note="Cytoplasmic (Potential) ." 
161 . .179 
/gene="AGTR2 " 

/region_name=" Transmembrane region" 

/note="4 (Potential)." 

180. .208 

/gene="AGTR2" 

/region_name=" Domain" 

/note="Extracellular (Potential) ." 

209. .234 

/gene="AGTR2" 

/region_name=" Transmembrane region" 

/note="5 (Potential)." 

235. .256 

/gene="AGTR2" 

/region_name=" Domain" 

/note="Cytoplasmic (Potential) ." 

257. .278 

/gene="AGTR2" 

/region_name=" Transmembrane region" 

/note="6 (Potential)." 

279. .285 

/gene="AGTR2" 

/region_name=" Domain" 

/note="Extracellular (Potential) ." 

286. .313 
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/gene="AGTR2 " 

/region_name=" Transmembrane region" 

/note="7 (Potential)." 
Region 314. .363 

/gene="AGTR2" 

/ reg i on_name= " Doma in " 

/note="Cytoplasmic (Potential) ." 
Site 354 

/gene="AGTR2 " 

/ si te_type= "phosphorylation" 
/note="(BY PKC) (POTENTIAL)." 

N 

1 mkdnfsfaat srnitssrpf dnlnatgtne safncshkps dkhleaipvl yymifvigfa 
61 vnivvvslfc cqkgpkkvss iyifnlalad llllatlplw atyysyrydw Ifgpvmckvf 
121 gsfltlnmfa siffitcmsv dryqsviypf Isqrrnpwqa syvvplvwcm aclsslptfy 
181 frdvrtieyl gvnacimafp pekyaqwsag ialmknilgf iiplifiatc yfgirkhllk 
241 tnsygknrit rdqvlkmaaa vvlafiicwl pfhvltflda Itwmgiinsc eviavidlal 
301 pfaillgftn scvnpflycf vgnrfqqklr svfrvpitwl qgkretmscr kgsslremdt 
361 fvs 
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IDENTIFIERS 



dbEST Id: 5909367 

EST name: UI-M-AMO-adp-b-06-O-UI . rl 

GenBank Acc: BE652532 

GenBank gi : 9978333 



CLONE INFO 

Clone Id: UI-M-AMO-adp-b-06-O-UI (5') 

Source: The NIH - University of Iowa Brain Molecular Anatomy 

Project: NIH-Iowa BMAP (Bento Soares, Thomas Casavar 
Val Sheffield) 

Id as DNA: UI-M-AMO-adp-b-06-O-UI 

Id in host: UI-M-AMO-adp-b-06 . rl 

DNA type: cDNA 



PRIMERS 

Sequencing: M13 Reversi 

PolyA Tail: Unknown 



GCTGCTTCAAGCGCCTGTGAGAAGCTAGAAAAGGCTAGGACTGACTTACAGJ 
CAAGAATTTGTCCAGAAACTAAACCAGCAGCATCAGACAGACCGGACGGAAC 
CGGCTGAAGGACTTATACACCGCAGAGTGTGAGAAGCTTCAGAGCATTTACJ 
GCAGAAAAATATAAAACTCAACTGCAAGAGCAGTTTGACAACTTAAACGCCC 
ACCACTAAGCTTGAGATTGAAGCTAGCCACTCGGAGAAGGTGGAATTGCTG/ 
TATGAAACCTCCCTTTCAGAAATCAAGAAGAGCCATGAGATGGAGAAGAAGT 
GATCTGCTTAATGAGAAGCAGGAATCGCTGGAGAAACAAATC/y^TGATCTG/ 
AACGATGCTTTAAACGAAAGGTTGAAATCAGAGGAGCAAAAGCAACTGTCA/ 
GCGAATTCAAA7VAACCCTCAGGTCATGTATCTGGAGC 



Entry Created: Sep 6 2000 
Last Updated: Sep 6 2000 



COMMENTS 

cDNA Library Preparation: M.B. Soares Lab Clone 
distribution: Researchers may obtain BMAP cDNA clone 
RESEARCH GENETICS. It should be noted that Bento So£ 
generating a small number of additional specialized 
non-redundant arrays of BMAP cDNAs whose availabilit 
be considered under appropriate and limited collaboi 
arrangements 



LIBRARY 

Lib Name: 



NIH BMAP MAM 



strain : 

Develop, stage: 

Lab host: 

Vector : 

R. Site 1: 

R. Site 2: 

Descript ion : 



C57BL/6J 
27-32 days 

DHIOB (Life Technologies) 

pT7T3D-Pac (Pharmacia) with a modified polylinker 
Not I 
Eco RI 

The NIH_BMAP_MAM library is a non-normalized librar:> 
constructed from mouse amygdala. The tag is a strinc 
nucleotides present between the Not I site and the c 
track. The library was constructed as described by E 
Lennon and Scares, Genome Research 6: 791-806, 1996. 
provided by Ms. Annie Novakovich, Zivic-Miller Laboi 



SUBMITTER 

Name : 

Institution : 
Address : 

Tel : 
Fax: 
E-mai 1 : 



Chin, H 

National Institute of Mental Health 

6001 Executive Blvd. Room 7N-7190, MSG 9643, Bethesc 
20892-9643, USA 
301 443 1706 
301 443 9890 
mEST@mail . nih . gov 



CITATIONS 

Medline UID: 
Title : 

Authors : 
Citation : 



97044477 

Normalization and subtraction: two approaches to fac 
gene discovery 

Bonaldo, M. F. , Lennon, G., Scares, M.B. 
Genome Res. 6 (9): 791-806 1996 



MAP DATA 
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Entrez 

Search for Genes 

LocusLink provides 
curated information for 
human, fruit fly, mouse, 
rat, and zebrafish 



Gl 


Version 


Update Date 


Status 


9978333 


1 


Sep 6 2000 6:24 PM 


Live 



6:24 PM 
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Normalization and subtraction: two approaches to facilitate gene 
discovery. 

Bonaldo MF, Lennon G, Scares MB. 

Department of Psychiatry, College of Physicians and Surgeons of Columbia University, 
New York, New York, USA. 

Large-scale sequencing of cDNAs randomly picked from libraries has proven to be a 
very powerful approach to discover (putatively) expressed sequences that, in. turn, once 
mapped, may greatly expedite the process involved in the identification and cloning of 
human disease genes. However, the integrity of the data and the pace at which novel 
sequences can be identified depends to a great extent on the cDNA libraries that are 
used. Because altogether, in a typical cell, the mRNAs of the prevalent and intermediate 
frequency classes comprise as much as 50-65% of the total mRNA mass, but represent 
no more than 1000-2000 different mRNAs, redundant identification of mRNAs of these 
two frequency classes is destined to become overwhelming relatively early in any such 
random gene discovery programs, thus seriously compromising their cost-effectiveness. 
With the goal of facilitating such efforts, previously we developed a method to construct 
directionally cloned normalized cDNA libraries and applied it to generate infant brain 
(INIB) and fetal liver/spleen (INFLS) libraries, from which a total of 45,192 and 86,088 
expressed sequence tags, respectively, have been derived. While improving the 
representation of the longest cDNAs in our libraries, we developed three additional 
methods to normalize cDNA libraries and generated over 35 libraries, most of which 
have been contributed to our integrated Molecular Analysis of Genomes and Their 
Expression (IMAGE) Consortium and thus distributed widely and used for sequencing 
and mapping. In an attempt to facilitate the process of gene discovery further, we have 
also developed a subtractive hybridization approach designed specifically to eliminate 
(or reduce significantly the representation of) large pools of arrayed and (mostly) 
sequenced clones from normalized libraries yet to be (or just partly) surveyed. Here we 
present a detailed description and a comparative analysis of four methods that we 
developed and used to generate normalize cDNA libraries from human (15), mouse (3), 
rat (2), as well as the parasite Schistosoma mansoni (1). In addition, we describe the 
construction and preliminary characterization of a subtracted liver/spleen library 
(INFLS-SI) that resulted from the eliminafion (or reducfion of representation) of -5000 
INFLS-IMAGE clones from the INFLS library. 

PMID: 8889548 [PubMed - indexed for MEDLINE] 
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IDENTIFIERS 

dbEST Id: 

EST name: 
GenBank Acc : 
GenBank gi : 



1607105 

vx39f05.rl 

AA880300 

2989283 



CLONE INFO 

Clone Id: 
Source : 
DNA type: 



IMAGE: 1277601 (5') 
IMAGE Consortiuin, LLNL 
cDNA 



PRIMERS 

Sequencing : 
PolyA Tail: 



-28ml3 revl ET from Amersham 
Unknown 



SEQUENCE 



Quality: 



CTTTCCCGTGGTAACACCAAGTTTGAAGCGCTGACAGTTGTGATCCAGCACC 
GAGCGGGAGGAAGCACTGAAGCAACACAAAACCCTCTCTCAAGAACTTGTCJ 
GGAGAGCTAGTTGCTGCTTCAAGCGCCTGTGAGAAGCTAGAAAAGGCTAGGC 
CAGACAGCGTATCAAGAATTTGTCCAGAAACTAAACCAGCAGCATCAGACAC 
GAACTGGAGAACCGGCTGAAGGACTTATACACCGCAGAGTGTGAGAAGCTTC 
TACATTGAGGAGGCAGAAAAATATAAAACTCAACTGCAAGAGCAGTTTGAC; 
GCCGCCCATGAGACCACTAAGCTTGAGATTGAAGCTAGCCACTCGGAGAAGC 
CTGAAGAAGACCTATGAAACCTCCCTTTCAGAAATCAAGAAGAGCCATGAG/ 
AAGTCACTGGAGGATCTGCT 

High quality sequence stops at base: 464 



Entry Created: Mar 26 199 
Last Updated: Mar 26 199 



This clone is available royalty-free through LLNL ; 
the IMAGE Consortium (info@image.llnl.gov) for furth 
information . 
MGI : 669401 



LIBRARY 

Lib Name: 
Organism: 
Strain : 
Sex : 
Organ : 

Tissue type: 



Stratagene mouse lung 937302 

Mus musculus 

C57BL/6 X CBA 

female 

lung 

lung 



Vector : 
R. Site 1: 
R. Site 2: 
Description : 



pBluescript SK- 

EcoRI 

Xhol 

Cloned unidirectionally . Primer: Oligo dT. 6-8 montl- 
female lung and 1.5 year old male lung were source c 
Average insert size: 1.5 kb; Uni-ZAP XR Vector; ~5' 
sequence: 5' GAATTCGGCACGAG 3' -3' adaptor sequence: 
CTCGAGTTTTTTTTTTTTTTTTTT 3' 



SUBMITTER 

Name : 
Lab: 

Institution : 
Address : 
Tel : 
Fax : 
E-mail : 



Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 6;- 

314 286 1800 

314 286 1810 

mouseest@watson.wustl.edu 



CITATIONS 

Title: 
Authors : 



Year: 
Status : 



The WashU-HHMI Mouse EST Project 

Marra, M., Hillier,L., Allen, M., Bowles, M., Dietrich, 
Dubuque, T., Geisel,S., Kucaba,T., Lacy,M., Le,M., Ma 
Morris, M., Schel lenberg , K . , Steptoe,M., Tan,F., Unde 
, Moore, B., Theising,B., Wylie,T., Lennon,G., Soares 
Wilson, R., Waterston,R. 
1996 

Unpublished 
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rat, and zebrafish 



Gl 


Version 


Update Date 


status 


2989283 


1 


Mar 27 1998 12:37 AM 


Live 



Accession AA880300 was first seen at NCBI on Mar 27 1998 
12:37 AM 
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Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

LAST 2 SEQUENCES RESULTS VERSION BLASTN 2.2.6 [Apr-09-2003] 

latchipT' Mismatch:F2^ gap openjs gap extension: p 
dropoff: l50 expect :(lO OOOi wordsize: jl1 Filter F . Align | 

'"^"""^^ lcl|seq_l 
gi 

2989283 IMAGE: 1277601 5'. 



Length 354 
Length 500 




(1 .. 
354) 



500) 



OTE:The statistics (bitscore and expect value) is calculated based on the size of nr database 

OTE:If protein translation is reversed, please repeat the search with reverse strand of the query sequence 



:ore = 535 bits (278), Expect = e-149 
lentities = 278/278 (100%) 
5trand = Plus / Plus 



lery: 1 catcagacagaccggacggaactggagaaccggctgaaggacttatacaccgcagagtgt 60 

I I I I I I I I I I I I I I I I I I 1 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3jct; 223 catcagacagaccggacggaactggagaaccggctgaaggacttatacaccgcagagtgt 282 

lery: 61 gagaagcttcagagcatttacattgaggaggcagaaaaatataaaactcaactgcaagag 120 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
)jct: 283 gagaagcttcagagcatttacattgaggaggcagaaaaatataaaactcaactgcaagag 342 

lery: 121 cagtttgacaacttaaacgccgcccatgagaccactaagcttgagattgaagctagccac 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
)jct: 343 cagtttgacaacttaaacgccgcccatgagaccactaagcttgagattgaagctagccac 402 
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Urge-scale sequencing of cDNAs randomly picked from libraries has proven to be a very P<>^;f;/"' f P^^^"^^ 
to discover (pucatively) expressed sequences that, in turn, oftce mapped, may greatly expedite the process 
Involved in the Identification and doning of human disease genes. However, the .ntegr.ty of the data and the 
pace at which novel sequences can be Identified depends to a great extent on die cDNA libiarles that are 
used. Because altogedier, in a typical cell, the mRNAs of the prevalent and '"^^""^'I'f!^^"'^.^^^^^ 
comprise as much as 50-^5% of the total mRNA mass, but represent no more than 1000-2000 different 
mRNAs, redundant identification of mRNAs of these two frequency classes is desuned to become 
overwhelming relatively early in any such random gene discovery programs, thus seriously compromising 
their cost^ffectlveness. With the goal of facilitating such efforts, previously we developed a method to 
construct direalonally doned normalized cDNA libraries and appfied it to generate Infant brain (INIB) and 
fetal liver/spleen (INFLS) libraries, from Whidi a total of 45,192 and 86,088 ^^pr^sed sequence Qgs, 
respectwely, have been derived. While Improving the representation of the longest cDls^ In our libraries, 
we developed three additional methods to normalize cDNA libraries and generated over 35 Mbrarles, most of 
which have been contributed to our Integrated Molecular Analysis of Genomes and Their Expression 
(IMAGE) Consortium and thus distributed widely and iised for sequencing and mapping In an attempt to 
facilitate the process of gene discovery further, we have also developed a subtractive hybridization approach 
designed specifically to eliminate (or reduce significantly the representation of} large pools of arrayed and 
(mostly) sequenced clones from normaliied libraries yet to be (or just partly) surveyed Here we Pr«ent a 
detailed description and a comparative analysis of four methods that we developed and used to generate 
normalize cDNA libraries from human 05), mouse (3), rat (2), as well as the parasite Schktosoma mansoni m. In 
addition, we describe the construction and preliminary characterization of * subtra^ed liver /spleen 
(INFLS^l) that resulted from die elimination (or reduction of representation) of -5000 INFLS^IMAGE clones 
from the INFLS library. 

Large-scale single-pass sequencing of cDNA cally unpak the oveiall efficiency of the ap- 

ciones randomly picked from libraries has proven proach. but also seriously compromise the integ- 

to be a powerful approach to discover genes (Ad- rlty of the data that axe generated. Among such 

-„ns et al 1<>91, 1993a,b, 1995; Khan et al. 1992; iunky clones are: (1) clones tliat consist exdu- 

MeComble el al. 1992; Okubo ct aL 1992; Mat- sively of poly{A) tails of mRNAs; (2) clones that 

subara and Okubo 1993; see also Hillier et al., this contain very short cDNA inserts; (3) fo"" J^'a^ 

issiic) However, the significance of using cDNA contain nothing but the 3' half of the Nofl- 

librafies thai are well suited for this purpose oligoCd'Ois pflmer used for synthesis of tiist- 

should not be underestimated (Adams et al. strand cDNA ligated to an adaptor; and (4) chl- 

ig^^M merle clones, I.e., cDNAs derived from different 

Ordinary cDNA libraries may contain a high niRNAs joined artifactually during ligation . Fur- 

frequency of undesirable ("iunky") clones (Ad- thcimore, given that, as a general rule, the fre- 

aml et al. 1991, 1992) that may not only drasti- quency of occurrence of a cDNA clone m a library 

15 equivalent to that of Its corresiK>ndtng raRNA 
in the cell, even higli-quaUly cDNA libraries may 
ES?SSS2rf««'«lumb.a..d„: rxx (»2) 7«,.M77. not be ideal for large-scale sequencing. 
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Roassociation-kinctics analysis indicates that 
the uiRNAs of a typical somatic cell are distrib- 
uted in tJiiee frequency classes; (1) superpreva- 
lent (consisting of about 10-15 mRNAs that alto- 
gether represent 10-20% of the total mRNA 
mass); (2) intermediale (10CX)-2000 mRNAs; 40- 
45%); and (3) complex (15,000-20,aX) niRNAs; 
40-45%) (Bishop et al. 1974; Davidson and Brit- 
ten 1979). Accordingly, once most mRNAs of the 
prevalent and intermediate frequency classes are 
Irlmtificd, rcdimdancy levels are expected to be- 
couie greater than 60%. For this reason, the use 
of normalized libraries, in which the frequency of 
all doncs is within a narrow range (Scares et al. 
1994), has been shown to be beneficial for large- 
scale sequencing (Berry et al. 1995; Houlgatte et 
al. 1995). Calculations show that at Cot = 5.5 
[where is the total DNA concentration and t is 
the time (moles nucleotides per liler x sec)], of 
the three kinetic classes of mRNAs, the most 
abundant species are diminished drastically, 
while all frequencies are brought within the 
range of one order of magnitude (Soares et al. 
1994). 

However, because a large fraction of all hu- 
man genes has been identified already, redun- 
dant identificaUon of genes that are expressed in 
multiple tissues cannot be avoided simply by the 
use of aoraialijied libraries. Hence, we argue that 
the use of subtractive cUNA libraries enriched for 
genes expressed at low levels and that have not 
yet been identified should become Increasingly 
more advantageous for large-scale sequencing 
programs. 

While attempting to Improve the. repre.senta- 
tion of the longest cDNAs in our libraries, we 
developed three methods for construction of nor- 
malized libraries, in addition to the procedure 
that we described previously (Soares et al. 1994:), 
and used ihem successfully to generate normal- 
ized cDNA librarles'from human (15), mouse (3), 
rat (2), and Schistosoma mansoni (1) tissues. All 
human and mouse cDNA Ubraries have been con- 
tributed to the Integrated Molecular Analysis of 
Genomes and Their Expression (IMAGE) Consor- 
tium (Lennon et al. 1996), and to dale a total of 
315,408 expressed sequence tags (ESTs) have 
hiccn derived from these libraries (dbEST release 
052396; htt[j://www.ncbl.nlm.nih.gov). 

Mere we presenl a detailed descriplion and a 
comparative analysis of the four jriethods that we 
have de\'eloped to normalize rDNA libraries; we 
describe a simple procedure for tiie consUuctiun 
of subtractive cDNA libraries; and we discuss 



strategies that taVe advantage of subtractive hy- 
bridization to expedite the ongoing IMAGE./ 
Washington UniversiLy/Merck gene discovery 
program. 

RESULTS 

While attempting to improve the representation 
of the longest cDNAs in our rionnalized lihraiies, 
we developed four methods and constuicted over 
35 libraries, most of which are described here. A 
list comprising 15 human, three mouse, two rat, 
and one scliistosomc library with Iheir respective 
names, number of recombinants, sequence tags, 
and racthod.s used for normalization and prepa- 
ration of single-stranded plasmids is shown in 
Table 1. 

Extensive characterization of two norm.ilizcd 
libraries [nonnalizeU infant braui (INIB) and nor- 
malized fetal spleen (INFLS)] constructed accord- 
ing to our previously described procedure (Soatcs 
et al. 1994; here designated as method 1) con- 
firmed our original obsetvaUorxs that a great ex- 
tent of normalization can be achieved (^ith this 
method for mo.st cDKIA species (e.g., cf. lanes 
9,10 in Fig. IM-P). It is noteworthy that the fre- 
quency of cDNA 122 (used as the probe in P) was 
increased with normalization from <0.0006% In 
the starting library to 0.007% In the INIB library 
(Soares et al. 1994). However, Soutliern hj'brid- 
ization of starting and normalized libraries with a 
battery of cDNA probes revealed that on occasion 
truncated clones were favored over tlieir longest 
counterparts during the process. This was first ob- 
served when Southern blots of Notl -i- H/iidlll- 
dlgested plasmld DNA from starting and normal- 
ized Infant brain libraries were hybridized with a 
cDNA probe for mitochondrial 16S rRNA (see Tig 
IL, lanes 9,10). Not only was the frequency of 
these mitochondrial cDNA clones not reduced ef- 
fectively during the process of normalization (fre- 
quency of ocrurrenre In starting and noimalized 
infant brahi libraries was 1,4% and 1.0%, respec- 
tively), but also the length of the hybridizing cO- 
NAs was noticeably smaller In the normalized li- 
brary; Comparative sequence analysis (not 
shown) of a number of hybridizing mitochon- 
drial 16S rRNA clones from both starting and nor- 
mall7.etl libraries revealed that whereas the 3' end 
of most cDNAs derived from the starting library 
corresponded to the bona fide 3' end of the lOS 
rRNA, the 3' end of the majority of the cDNAs 
Isolated from the normalized library corre- 
sponded to sequences further upstream on the 
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Table 1. Complete Ust and Main Features of the Normalized Human, Mouse, Rat, and 
Schistosome cDNA Ubraries 



recombinants Preparation 
in the of single- 
normalized stranded 
librafy plasmids 



Human Infant brain" 
Humen fetal liver spleen' 



Human term placenta 
Human 8-9W placenta 
Human breast^ 
Human adult brain' 
Human retina" 
Human pineal gland' 
Human ovaiy tumoi* 
Human mftlanocytes'" 
Human fetal heart 

Human parathyroid 

Human sencsrent figrobiast" 
Human multiple sclerosis plaques" 
Human fetal king' 
19.S-dpc mouse embryo" 
1 7.$-dpc mouse embryo" 
U.S- to 143-dpc mouse embryos" 
Rat heart" 
Rat kidney^ 

8-week-old adult schutosome' 



Nb2HFLS20W(1 NFLS) 

5Nb2HFLS20W 

6Nb2HFLS20W 

14Ni52HFi^Z0W 

15Nb2HFLS20W 

Nb2HP 

2NbHP8-9W 

2NbHbsi-3NbHBsi* 

N2b4HBS5Y-N2ta5HBS5Y9 

2N2b4HR-N2biHH 

3NbHPC 

NbHOT 

2NbHM 

NbHHlSVS' 

NbHPA 

NbHSF 

2NbHM$P 

NbHIiaw 

P3NMF19.5 

NljMEl 7.5 

Nbk/IKH-VM.S 

NbRH 

2NbRK 

NbSSW 



2,500,000 
19,000.000 
3,200,000 
1,400,000 
3,200,000 
35,000,000 
750,000 
100,000 
2,090,000 
3,170,000 
1,600,000 
1,000,000 
1,100,000 
6,800.000 
9,700,000 
3.400,000 
9,900,000 
1,100,000 
21,700,000 
3.400,000 
6,SOO,000 
380,000 
400,000 
130,000 
1,000,000 



In Jtro 
in vitro 



ACAAC 
GACAC 
CGAAA 
ACAAC 



raries w«r« ewistrurted in thf pTm-Poc ve 



"-.:r^rrait'=;s<r™'Sa~4^^^ 

Columbia University. New York. NY)*va5 from a 72-day-old twnalc wno 

en«e of spinal mujcular aMphy. 
fetal liver spieen Odndly provlcled by Dr. SlepI 



„ ...i rare restriction enzyme (NoU or 
nn parathyroid adenoma, senescent fibre 
' - --rain (Itindly provided by Or. Con 



an Medical Center. New Vnrk. NY) v 
n reducnon mammoplaity tissue wes Idndly provided by Dr. 



postconcepUon nofmal leimle. 

^rotal cellular poly(Ar nRNA tram nonnal breast pooled 

he,rph1;^:^«lj«l whUe n,alT;,, basal 9»"3na. ' " 

»N2b4HB5SY and 2IvUb4HR differ from N2bSHBSSY and N2b$HR, re^jettlveiy, in Uie <verai|esue 

— "-^Stor nomial h«na,. reiina RNA Cklivily provided by Dr. Roderick R. Mdnnes, University of Toronto 



eircDNA ir 



.-ti 0-5-2.5 



I 0 5S -year-old Caucasian male. 
Td fkindly provkled by Dr. DavW Klein, NaUonal 
. , » 'r.. . gij,^ (ginnd 1 : 4ltyei 
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Figure 1 Comparative analysis of storting and normal- 
ized cDNA libraries by Southern hybridization with 14 
cDNA probes. The 0.01 5 fig Pod + EcoRI digcrtcd plas- 
mid DNA from the starting fetal liver/spleen library (lane 
6), from the normalized fetal liver/spleen libraries con- 
structed according to method 2-1 (lane 1), method 2-3 
(lane Z). method 2-2 (lane 3), method 1 Oane 4), method 
4 (lane 5), and from the liver/spleen mini-libraries en- 
riched for abundant cDNAiS (HAP-bound fractions) gener- 
ated with method 2-1 (lane 7) and method 4 (lane 8) 
were electrophoresed on 1 % agarose gels, transferred to 
nylon membranes (GeneScreenPlus; DuPont/NEN) and 
hybridized at 42'C In 50% formamide, 5 x Denhardt's 
solution, 0.75 m NaCI, 0.15 m Tris (pH 7.5), 0.1 m sodium 
phospiiate, 0.1% sodium pyrophosphate, 2% SDS con- 
taining sheared and denatured salmon sperm DNA at 1 00 
)xg/ml. Similarly, 0.05 ti-g Not\ + HindW digested plasmid 
DNA from the starting (IB; lane 9) and normalized (1 NIB; 
larie 10; method 1) Infant brain libraries (Soares ct al. 
1 994) were electrophoresed, transferred, and hybridized 
as described above. Radioactive probes were prepared by 
random primed synthesis using the Prime-It 11 kit (Strata- 
gene). The following probes were used: a-globin (A), „hr..,^h^ 

B-globin (B), -y-globin (Q, serum albumin (O, shorter exposure, £, longer exposure), acidtc nbosomal pho.pho- 
protein PO (0, HI 9 RNA (C, shorter exposure; H. longer exposure), apolipoprotein A (/), angiotensinogen (^, 
unlcnown cDNA 8 {K), mitochondrial 1 6S rRNA (L). a-TubuIln (M), myelin basic protein (N), ^^J' 
and unknown cDNA 1 22 (P). All probes were contaminated intentionally witii a small arnount of vector DNA to 
enable visualizaOon of vector bands and thus confirm that a similar amount of library DNA was loaded in all lanes. 
(V) vector band, which is released from the cONA inserts by double digestion with the restnaion enzyrmes 
specified above. 




16S rRNA. me occurrence of such 3' truncations 
was also documented by sequence analysis (not 
shown) for serum albumin cD>3As in the fetal 
liver/spleen library (see Fig. 1D,E, lanes 4,6). 

Reasoning that this problem could be cir- 
cumvented If the fmgincnts used In the hybrid- 
ization with the single-stranded circles (1) were 
in excess, and (2) spanned the entire length of 
the cDNAs, we developed an alternative proce- 
dure to normalize cDNA libraries based on liy- 
bridization of in vitro synthesized RNA (driver) 
from an entire library with the library itself in the 
form of .single-stranded circles (tracer) (see. meth- 
ods 2-1 and 2-2 in Fig. 2). Several i)oniiali;£ed li- 
braries were generated by this procedure (see 
■1-ablc 1). 

Southern hybridization of endonutlease- 
i?stiicted plasmid DNA from starting and nor- 
inalized Ubraries with a number of cDNA probes 
(Fig. 1) Indicated clearly that these methods ef- 
fectively improved tlie lepreseiitation of tlie 
longest cDNAs in the nomialized libraries (e.g., 
cf. lanes 1,4 in Fig. 1A,D,K,C,H). However, charac- 
terization of one of these libraries (SNb2HFLS20W) 



by colony hybridization with cDNA probes (not 
shown) indicated that this approadh was effective 
to reduce the frequency of .some, but not all, of 
the most abundant clones (e.g., serum albumin 
was reduced about 20-fold, whereas -y-globin was 
rcduccxl only twofold). No difference was ob- 
served when hybridizations were performed at 
different conditions [0.4 m NaCI and 50% for- 
mamide at 42°C as in methods 2-1 and 2-3; 0.12 
M NaCI, 50% formamide, and 1% sodium dodecyl 
sulfate (SDS) at 30°C as Ui method 2-2 (see lane 3 
in Pig. 1); 0.4 m NaCI and 80% formamide at 
42°C, not shown]. 

It is noteworthy that Northern hybridization 
(not shown) of in vitro transcribed RNA synthe- 
sized from an entire plasmid library with prohas 
derived from the abundant cDNAs tliat failed to 
be normalized effectively by this procedure (e.g., 
globins in the fetal liver/spleen library and glyc- 
era1dehyde-3-phosphate dehydrogenasH (G3PD) 
in the breast library) indicated that tliey were not 
as prevalent in the population of in vitro tran- 
scribed RNAs as they werp in their respective 
starting cDNA libraries. 
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Figure 2 Diagram of the normallzatifm 
methods 2-1, 2-2, and 2-3. Double- 
stranded plasmid DNA representing an en- 
tire starting library is (1) linearized wilfi ei- 
ther SffI, Not\, or Pac\ and used as template 
for synthesis of RNA in vitro using Vi or T7 
RNA polymerases, and (2) converted to 
singlR-stranded circles either In vivo, upon 
electroporation into DH5aF' and superin- 
fection with Ml 3K07, or in vitro by the 
combined action of Gene II and Exonucle- 
ase III (Life Technologies). Single-stranded 
plasmid DNA is HAP-purified and hybrid- 
ized (Cot - 5) with excess RNA (prctrcatcd 
with RNase-free DNAse I; Promega), 
blocked with appropriate oligonucleotides 
to prevent hybridizatior* through corrimon 
vector sequences (see Methods section). 
Both the fraction that remains single- 
stranded (flow-through) as well as the re- 
sulting hybrids (bound) are purified by HAP 
chromatography. The HAP flow-through 
fraction is converted to doubfe-stranded 
plasmlds, elearoporated Into DHIOB bac- 
teria (Ufe Technologies), and propagated 
under ampicillin selection to generate an 
amplified normalized library (methods 2-1 
and 2-2, depending on the conditions used 
for hybridization; see Methods section). 
The HAP-bound fraction Is also converted 
similarly to double-stranded plasmids, elec- 
troporated into bacteria, and propagated 
under ampicillin selection to generate a 
mini-library enriched for abundant cDNAs. 
Double-stranded plasmid DNA from this 
mini-library Is linearized and used as tem- 
plate for synthesis of RNA in vitro. After di- 
gestion of the plasmid DNA template with 
ribonudease-free DHAse I (Promega), tlie 
RNA (driver) is blocked with appropriate 
oligonucleotides and hybridized (Cot 
-100-200) with HAP-purified single- 
stranded plasmlds derived from the start-- 
ing library (see above). The remaining 
e purified by HAP chromatography, converted to double-stranded cirdes, etectropor- 
^j^^. . / I , 1 1^ toio/-tiiin to n<inpraf-p an amoliflea nor- 
malized library (method 2-3). 




/ 



' A significantly improved extent ot normal- 
jzatioii was adiieved xvheii runoff RNA synthe- 
sized from the pla.sTnld minl-llbrarj' enriched for 
abuiidaut cDNAs (hydroxyapatite (HAP)-bouTiU 
fraction of method 2-1 In Fig: 2) was hybridized 
(Cot = 100-200) with single-stranded circles from 
the starting library (see method 2-3 In Hg. 2 and 
Table 1; cf. lanes 1,2 bi Vig. 1A-D,F.G). 

In an effort to preserve the positive charac- 



teristics of t>oth methods 1 and 2 (i.e., the ad- 
equate extent of normalization achieved with 
metliod 1, and the improved representation of 
the longest cDNAs achieved with method 2), we 
developed two addUional leassociation kinetics 
based procedures involving DNA-DNA hybrid- 
izaUon (mcthod-s 3 and 4; see Fig. 3). 

Method 3, which was successfidly used to 
construct a normalized library from multiple 
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.sderosis plaques (see 2NbHMSP in Table 1). in- 
volved hybridization of a 20-fold excess of single- 
stranded cDNA fragments (comprising ttie 5' 
halves of all inserts of the starting lUjrary, gener- 
ated by Exonuciease 111 digestion of gel-purified 
double-stranded cDNAs; see Fig. 3) wltli complc- 
TTimtaiy slngle-stianded tirdes produced in viUo 
by the combined action of Gene U and Exonude- 
ase 111 (Life Technologies). 

Southern hybridization of Not\ + EcoRI- 
dlgested plasmlU DNA from tlie starling and uor- 
malized (with methods 2-1 and 3) multiple scle- 
rosis plaques library with mitochondrial 16S 
rRN.\ and myelin basic protein cDNA probes (not 
shown) clearly Indicated that method 3 was su- 
perior to method 2-1 in that a much greater ex- 
ten t of normalization was achieved, at the same 
time that it maintained (siinllar to method 2-1) 
appropriate representation of the longest cDNAs 
in both cases. 

For the libraries consLrucLed with method 4 
(see Table 1 and Hg. 3), double-stranded cDNA 
inserts generated by the polymerase chain reac- 
tion (PGR) with T3 and T7 primers were indted 
and hybridized (in the presence of vast excess of 
blocking oligonucleotides) with single-stranded 
plasmid library DNA prepared In vitro. 

Southern hybridization of Pad + EcoRl- 
digested plasmid DNA from starting and normal- 
ized (with methods 1, 2-1-2-3, and 4) fetal liver/ 
spleeii libraries (Fig. 1) with several cDNA probes 
(induding ihose Uial revealed incomplete nor- 
malization with methods 2-1-2-3, such as a-glo- 
bln, p-globln and 7-globln) demonstrated t)ie ef- 
ficacy of method 4 in achieving the desired ex- 
tent of normalization obtained with method 1 
(cf. lanes 1-6 in Fig. 1 A-D, F-H, and lane.<! 3-6 In 
Pig. ll-K) while preserving the representaUon of 
the longest cDNAs (e.g., the longest albumin 
cDNA was present in the normalized library pre- 
pared with method 4, shown in lane 5 of Fig. 
1D,E, but it was undetectable in the normalized 
library constn.icted with method 1, shown in 
lane -4; a .similarly remarkable difference was re- 
vealed with the cDNA probe for H19 RNA, shown 
in Fig. 1G,H). Characterization of the normalized 
library generated with method 4 by colony hy- 
bridization with 10 cDNA probes (not shown), 
which occur at a wide range of frequencies in the 
.starting library, confirmed the effectiveness of 
the procedure to narrow their frequencies down 
to within one order of magnitude (e.g., tlie fre- 
quencies of tiie cDNAs for -y-globin, «-globin, 
p-globin, HIO RNA, and transferrin were reduced 




Figure 3 Diagram of the normalization methods 
3 and 4. In method 3 double-stranded plasmid DNA 
from a starting library is digested with restriction 
enzymes Uiut generate 5' protruding ends, and 
the excised cDNA inserts are gel-purified from the 
doning vector and digested with Exonucleasc III 
to yield noncomplementary singie-stranded frag- 
ments, each representing half of a cDNA insert. 
Note that the single-stranded fragments that 
span the 5' half (but not the 3' half) of the cDNA 
inserts are complernentary lo single-stranded plas- 
mids prepared in vitro. These .single-stranded DNA 
fragments are blocked with appropriate oligo- 
nudeotldes (see Methods) and hybridized with 
single-stranded library DNA prepared in vitro 
{middle column). The remaining single-stranded 
circles are HAP-purified, converted to double- 
stranded plasmlds, electroporated into DH10B 
bacteria (Ufe Technologies), and propagated under 
ampicillin selection to generate a normalized li- 
brary. In method 4, single-stranded library DNA 1$ 
used as template for PCR amplification with T3 
and T7 primers. PCR-amplified cDNAs are purified 
from exceiis primers, melted, and hybridized with 
single-stranded library DNA in the presence of 
blocking oligonudeotides. The remaining single- 
stranded circles are purified by HAP diromatog- 
raphy, converted to double-stranded plasmids, 
electroporated into bacteria, and propagated 
under ampicillin selection to generate a nornialized 
library. 
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from 9.2%, 6.4%, 3.6%. 1 .B%, and <0.2% to 0.04%, 
0.02%, 0.01%, 0.1% and 0.1%, respectively). 

Ill order to assess further the ability of the-se 
noimallzation procedures to preferenlially re- 
duce the representation of the most abundant 
cDN.As, we have performed a comparative .se- 
quence analysis (not shown) of 100 clones picked 
randomly from the fetal iiver/spleen cUNA li- 
brary noTma«7etl with method 4 (14Nb2HilS20W 
ill Table 1; H/VP-flow-through fraction In Fig. 3), 
and from two fetal liver/spleen mini-Ubrarles en- 
riched for abundant cDNAs (HAP-bound frac- 
tions m Figs. 2 and 3) obtained during HAP pu- 
rification of the normalized libraries prepared ac- 
cording to methods 2-1 (5Nb2nFLS20W) and 4 
(14Nb2Hn520W). A number of cONAs known to 
be prevalent in the starting fetal liver/spleeu 11- 
brarv (e.g., albumin, -y-globin, u-globin, p-globin, 
mitochondrial RNAs, and apolipoprotclns A and 
H) were found at Increased firequencies in both 
mini-libraries enriched for abundant cDNAs, but 
none of them was represented in the sample of 
too clones from the normalized library. It Is note- 
worthy that while 47% of Uie sequences derived 
from the normalized library were not represented 
In the "all nonredundant" subdivision of se- 
quences of OenBank + EMBL + DDBJ + PDB, the 
majority of the sequences obtained from the 
mini-libraries of abundant cDNAs derived from 
methods 2-1 and 4 (91.4% and 86.9%, respec- 
tively) did have hoinologous sequences hi that 
data base. Furthermore, although 49% of the se- 
quences derived from the nomializcd Ubrary had 
fewer than 10 homologous ESTs In the dbEST 
svibdivision of GenBank, most of the sequences 
obta'mcd from both mini-libraries had greater 
than 10 homologous ESTs in tlie dbEST database 
(92.5% and 89.7%, respectively, in the HAP- 
bound fractions of methods 2-1 and 4). 

With the ultimate goal of facilitating the on- 
going process of gene discovery by large-scale se- 
quencing of cDNA clones picked randomly from 
libraries, wc have performed a pilot subttactive 
hybridization experiment to eliminate (or reduce 
representation of) a pool of approxunately 5000 
IKfAGE Cjjnsortium-arrayed cDNA clones (pool 
no. 1, LLAM 78-90) from the normalized library 
irom which they were derived (INFLS in Table 1). 
PCR-amplified cDNA inserts from pool no. 1 were 
niPlted and hybridized, in the presence of block- 
ing oligonucleotides, with single-strdnded pla-s- 
mid DNA from the INFLS Ubrary, prepared in 
vitro. The remaining single-stranded circles were 
purified by HAP chromatography, converted to 



double-strarided plasmids, electropotated into 
bacteria and propagated under antibiotic selec- 
tion to generate the subtracted INFLS-Sl library 
(see Fig. 4). Prelimiuary characterization of the 
iNFLS-Sl library by Souther n hybridization with 

10 cDNA probes (only five are shown; see Fig. 5) 
known to be repreMJnted hi pool no. 1 indicated 
clearly the effectiveness of the procedure to 
eliminate (or to reduce, the representation of) all 

1 1 cDNA sequences in the INFLS library. A 
BLASTN search of the dbEST division of GenBank 
(6/12/96) with 3' ESTs obtained from the five 
probas (cDNAs -1, -4, -8, -9, and -10) the hybrid- 
izations of which were not shown in Figure 5, 
revealed the presence of 0, 0, 1, 2, and 2 cone- 
spondlng ESTs, respectively, ftom the ItmS li- 
brary, thus Indicating that the subtraction was 
successful even for cUNAs that were under- 
represented in the normalized library (a total of 
44,407 3' ESTs have been derived from the INFLS 
library to date). It should be noted that because of 
sequencing failures, some of the clones in these 
arrays may not yet have corresponding E.ST.>! in 
the public data bases. 

It ts noteworthy that when we attempted to 
perform the same subtrnctive hybridization ex- 
periment using, as driver, RNA synthesized in 
vitro from a plasmid DNA preparation of pool no. 
1, the results obtained were not satisfactory (not 
shown) In that subtraction could be demon- 
strated for some but not all tested clones (e.g, 
a-globin could not be subtracted effectively), 
similar to what we observed in normalizations 
with method 2-1. 

DISCUSSION 

As a result of an effort to improve the represen- 
tation of tlie longest cDNAs In our normalized 
libraries, we have developed four different meth- 
ods for nonnaUzation of directionally cloned 
cDNA libraries constructed in phagemid vectors, 
while conttibviting resources to the FMAGE Con- 
sortium (Lennon et al. 1996) and tiiereby facili- 
tating the ongoing gene discovery and mapping 
programs. Approximately 87.5% of all (human) 
IMAGE ESTs were derived from the normalized 
libraries described here. 

The normalization procedure (method 1) 
that we described previously (Soares et al. 1994) 
was applied for the construction of tlie INIB 
and INFLS normalized libraries, from which a 
total of 45,192 and 86,088 ESTs, respectively, 
have been derived (dbEST release 0S2396; http:// 
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Figure 4 Diagram of the suhtrartive hybrldteatlon 
procedure used to generate the 1NFLS-S1 library. 
Double-stranded plasmid DNA from a pool of 
-5000 IMAGE Consortium-arrayed cDNA clones de- 
rived from the INfLS library (pool no. 1, LLAM 78- 
90) was converted to single-slranded circles in vitro 
by the combined action of Gene II and Exonuclease 
ill (Ufa Technologies). The resulting single-stranded 
plasmids were HAP-purified and used as a template 
for PCR amplification with T3 and T7 primers. PCR- 
amplified cDNA inserts were purified from excess 
primers, melted, and hybridized with single- 
stranded circles (prepared in vitro) from the INFLS 
library, in the presence of appropriate blocking oli- 
gonucleotides. The remaining single-stranded 
circles were purified by HAP chromatography, con- 
verted to double-stranded plasmids, electroporated 
into DHIOB bacteria (Ufe I echnologies), and propa- 
gated under ampicillin selection to generate the 
(1NFLS-S1) subtracted library. 



www.n.cbj.n.Iin.nih.gov). Data analysis (see 
HDller et al., tbU issue) dpmonstrated solidly the 
efficacy of this approacl) in bringing the fre- 
quency of all clones to witliin a narrow range. 
Extensive characterization of these, twn iibrarles 
by Southern analysis, however, revealed that on 




Figure 5 C3iaracterlzatlon of the 1NFLS-S1 sub- 
tracted liver/spleen library by Southern hybridiza- 
tion with 5 cDNA probes. The 0.1 5 ixg Pac\ -i- £coRI- 
digestcd plasmid DNA from the fetal liver/spleen 
library nomialized with method 1 (TNFLS; lane 7)- 
from the pool of -5000 IMAGE Consortium-arrayed 
cDNA clones derived from the 1 NFLS library (pool 
no. 1, LIAM 78-90; lane 2), from the subtracted 
library generated according to the diagram shown 
in Fig, 4 (1NFLS-S1; lane 3), and from the HAP- 
bound fraction obtained during HAP purification of 
the INfLS-SI library (see Fig. 4) were electropho- 
resed, transferred to nylon membranes, and hybrid- 
ized as described In the legend to Fig. 1 . The fol- 
lowing cDMA probes were used: a-globin (A), -y-gb- 
bin (S), serum albumin (Q, unknown cDNA 7 (D; 
pidced randomly from pool no. 1 , LLAM 78-90), and 
unlcnown cDNA 5 (£; picked randomly from pool 
no. 1. LLAM 78-90). A BIASTN search of the dbtST 
subdivision of Ccnbank with 3' ESTs derived from 
cDNA 7 and cDNA 5 revealed the presence of 33 
and 0 corresponding ESTs, respectively, from the 
INFLS library. All probes were contaminated inten- 
tionally with a small amount of vector DNA to en- 
able visualization of vector bands and thus confirm 
that a similar amount of library DNA was loaded in 
all lanes, (V) vector band; (U) residual undigested 
plasmid. 



occasion tmncatcd clones were favored over their 
longest counterparts during the normalization 

Because of the relatively permissive, condi- 
tions used for synthesis of first-strdnd cDNA, 
priming with the Nfrfl-tag-(d l )is oligonucleotide 
may occtir not only at the poly(A) tail of the mR- 
NAs but alsr> at Intp.rnal A-rich sites within the 
mRNAs (e.g., at Alu tails). Typically, cDNAs with 
3' truncations occur at frequencies of 10-15% in 
directionally cloned libraries. Truncated clones 
can be recognized (tentatively) as such, by the 
absence of a bona fide polyadenylation signal 
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sequence at the appropriate distance upstream 
from the 0lig0(dA)iK tat) of the cDKA. 

Why may truncated cDNAs be favored over 
their longest counterparts during normaH?atton 
by method 1? Briefly, method 1 (Soaxes et al. 
1994) Involves: (1) annealing of a single-stranded 
DNA preparation of a dlrectinii.illy cloned cDNA 
library with an oligo(dl"),8 priiaer; (2) controlled 
primeT ejctension reactions in the presence of de- 
oxynucleotldes and dideoxynucleotides to gener- 
ate 3' noncciding extension products of approxi- 
mately 200-300 nucleotides; (3) purification of 
the resulting partially double-stranded circles by 
HAP chromatography; (4) melting and reassocia- 
tion of the HAP-purified partially double- 
stranded circles to a relatively low Qjt (5-10); (5) 
purification of the remaining single-stranded 
circles (normalized library) over HAP; (6) conver- 
sion of the single-stranded circles to double- 
stranded circles; and (7) clcc:troporatlon into bac- 
teria. 

It could be anticipated that during the leas- 
sociatlon reaction, bt-causc truncated cDNAs oc- 
cur at lower frequencies than their nontruncatt^d 
counterparts, the extension products of the trun- 
cated cDNAs would more likely reanneal to the 
nonltuncated overlapping cDNAs than to their 
own truncated templates. On the other liand, the 
extension products of the nontruucaled cDNAs 
would most likely reassociate to their own non- 
truncated templates not only because they arc 
more prevalent but also because of the low prob- 
ability of there being an overlap between the 
short extension product of a nontnincated clone 
and a truncated single-stranded drcle. As a result, 
nontnincated single-stranded circles are more 
likely to end up re^ssnclated with more than one 
(nonoverlapping) extension product, whereas 
their truncated counterparts would remain 
single-stranded and therefore end up in the MAI> 
tlow-through fraction (normaI!2x;d library). 

Reasoning that this problem toidd be cir- 
cumvented if the hybridizing fragments (1) were 
in excess over single-stranded circles, and (2) 
spanned the entire lengUi of the cDNAs to maxi- 
mize the opportunity of overlap between trun- 
cated and nontnmcated clones, we devised an 
approach (melhods 2- 1 and 2-2; note that is 
the same as 2-1 except that hybridization condi- 
tions were different) whereby in vitro synlhe- 
i'netl RNA from a plasmid DNA preparation of a 
starting library is used as driver In hybridization 
(Col - 5) with the same library in the form of 
single-stranded circles. Indeed, these modifica- 



rions improved successfully the representation of 
the longest cDNAs In the normalized libraries 
(e.g., serum albumin in tlie Ilver/spleen libraries). 

However, in every library constructed with 
methods 2-1 and 2-2, wc were able to identify 
cDNA clones that seemed to become normalized 
with much greater difficulty than oUiers (e.g., 
ir-globin in the 5Nb2HFLS20W liver/spleen li- 
brary, and C.^PD in the breast library). We liiter- 
preted these results as suggestive that not all 
clones might be transcribed in vitro with tlie 
same efficiency if in a mixture (i.e., in vitro tran- 
scription of piasmid ONA from an entire library), 
and/or secondary structures in tire RNAs (or In- 
tcraction.s between RNAs) might impair their 
ability to hybridize with the .single-stranded 
circles. Tliese hypotheses were corroborated by 
the observation (not shown) that relatively weak 
hybridization signals were observed wlicn North- 
ern blots of RNA transcribed in vitro from an en- 
tire plasmid library were hybridized with cDNA 
probes derived from th<isc clones that could not 
be normalized as effectively, despite the fact that 
they occurred at high frequencies in the slarling 
libraries from which the in vitro transcribed 
RNAs were synthesized. We did exclude the pos- 
sibility that the clones Uiat were not being nor- 
malized effectively carried deletions that pie- 
vented them from Ix^ing transcribed appropri- 
ately in vitio (not shown). In fact, all clones tliat 
were tested individually for in vitro transcription 
yielded the expected amounts of full-length RNA. 
Although this problem was significantly mliii- 
mized in method Z-3 (cf. lanes, 1,2 in Fig. lA- 
D,F,C) the extent of normalization that was 
achieved was still not comparable to that ob- 
tained with method 1 (cf. lanes 2,4 in Fig. lA- 
D,r,H). 

The advantage, of method 2-3 over methods 
2-1 and 2-2 is that the RNA driver is derived from 
a mini-library (of relatively low complexity) en- 
riched for abundant cDNAs rather than from the 
entire starting library. For this reason, higher C^t 
hybridizations can be carried out to eliminate or 
reduce significantly the representatiort of the 
most abundant cDNAs. It should be noted, how- 
ever, that method 2-3 is not a true normalization 
procedure, because the aim of this approach is 
not to equalize the frequency of all cDNA clones 
but rather to reduce significantly (or even to 
eliminate, depending on the C^t used) the repre- 
sentation of the most abundant clones. 

The extent to which the enrichment for 
abundant transcripts can be achieved in such 
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mini-libraries depends essentiaUy on the C^t used 
forreassoclatlon. Calculations based on eslimates 
of frequencies of brain mUNAs (Scares et al. 1 994) 
indicate that the best enrichments are obtained 
at a Qt- 5 10. If the Q,t Is too low the 
enrichment is only for Uie mosl prevalent (class 
1) mRMAs; there is no enrichment for the mRNAs 
of the intermediate frcq\icnc>' class (class II) mR- 
NAs. On the other hand, if the Qjt is too '^'gh 
(fcSO) the enricfiment tor class 1 transcripts starts 
to become less significant because of a higher rep- 
resentation of mRNAs of the complex class (class 
111). Prevalent and intermediate (classes 1 -i- 11) 
brain inRNAs comprise 93-95% of the total 
cDNA population in a Qt =■ 5-10 HAP-bound 
mini-library, in contrast to 62% in the starting 
librarv. Comequently, the frequency of class III 
iransrrlpts in a Cnt=S-10 HAP-bound mini- 
library is about 5.5-fold lower than that of Uie 
starting library (5-7% in the bound mini-library 
vs. 38% in tlic starting library). 

Methods 3 and 4 were developed as a result 
of an attempt to achieve both the adequate ex- 
tent of normalization obtained with method 1 
and the Improved representation of the longest 
cDNAs accomplished with methods 2-1, 2-2, and 
2-3. Although more technically cumbersome, 
method 3 is superior to method 4 In that the 
DNA driver used in the hybridization is single- 
stranded. 

Single-stranded driver in method 3 (.see Fig. 
.3) Is generated by Exonuclease III digestion of 
gel-purilied double-stranded cDNA inserts ex- 
cised from the starting library. The resulting non- 
complementary single-stcanded fragments repre- 
sent the 5' and 3' halves of the original cDNA 
inserts. The fragments that correspond to the S' 
halves of the cDNAs are complementary to 
single-stranded urdes prepared in vitro, whereas 
the single-stranded fragments that correspond to 
the 3' halves of the cDNA inserts are complemen- 
tary to single-stranded plasmlds prepared in vivo. 
Note lhat for the miilUple sclerosis plaques li- 
brary constructed with method 3 we used single- 
strandcd circles prc'|)ared in vitro. 

Production of single-sltanded circles in vitro 
by die combined action of Ciene U and Exonucle- 
ase III (Life Technologies), rather than In vivo by 
superinfection of a culture with a helper phage, is 
very Ixiiieficlal because il circumvents the distor- 
tions that othenvise may arise as a result of the 
differential growth properties of clones with dif- 
ferent si^e inserts. However, because the diges 
Uon with OJene II results in the conversion of 



most, but not all, supercolled plasmlds to relaxed 
circles, it becomes necessary to purify the single- 
stranded circles that are produced after digestion 
with F.xonudcase. Ill by HAP chromatography. 

For construction of the normalized multiple 
sclerosis plaques library, the cDNA Inserts were 
excised by double digestion of plnsmid DNA from 
the starting library with Notl and EcdRl. The fact 
that one in every three clones might have an in- 
ternal EcoVl site (an Eco RI site is expected to oc- 
cur once every 4096 hp, and the average insert 
size in these libraries is of the order of 1.4 kb) 
should not compromise the efficiency of the pro- 
cedure, because at least one of the re.sultlng re- 
striction fragments would be expected to be 
2:200 bp (clones smaller than 400 bp are size- 
selected out of these libraries) and therefore be 
able to form hybrids that would bind quanULa- 
tively to HAP under our conditions. A disadvan- 
tage of method 3, as presented, is that only 
clones <2.9 kb (approximate vector size) can l>e 
OACised cleanly from the vector. It is conceivable, 
however, Uiat one might be able to use double- 
stranded cDNA fragments generated by PCR am- 
plificadon willi T3 and T7 primers as substrate 
for the E-tonucIease 111 digestion in method 3. 

Method 4 was used to generate a significant 
fraction of the libraries that were contributed to 
the IMAGE Coiisortiuiu (see Table 1). U is un- 
doubtedly the simplest and overall most advan- 
tageous of all procedures. Because the DNA driver 
is generated by PCS amplification of the starting 
(double-stranded or single-stranded, see below) 
plasmid library with T3 andT7 primers, the tracer 
(.•Jingle-stranded circles) used in this hybridiza- 
tion may be produced in vitro oi in vivo. 

Tlie extent of normalization achieved with 
method 4 was comparable to that obtained with 
method I with the advantage tliat it successful- 
ly preserved the representation of the longest 
cDNAs (cf. lanes 4,5 in Fig. 1). Moreover, method 
4 Is superior to method 1 because il does not 
preclude the clones derived from mRNAs with 
internal NofI sites from being represented In the 
normalized library. Because the starting material 
for the reassociation kinetics reaction in method 
1 is generated by a controlled primer extension 
reaction with an oUgo(dT) n, primer, clones with- 
out an ollgo(dA)iB tail (derived froru inRNAs with 
an internal Notl site) are not represented in the 
final normalized library, although they are not 
necessarily lost (clones without tails end up in 
\he HAP riow-lhrough fracdon during HAP puri- 
fication of the partially double-stranded chdes 
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genciatccl by this primer extension reaction). It 
should also be noted that this problem of method 
1 could be circumvented by the use of an oligo- 
nucleotide complementary to flanking vector se- 
quences [as opijosed to the oiigo(dT)iR] for this 
controlled primer extension reaction. 

The potential biases introduced by PC.R am- 
plification in method 4 are nriniiuized by the fact 
that (1) PCR ampUfication products are used in 
excess In these hybridizations, and (2) the size dis- 
tribution of inserts in these libroiies is relatively 
narrow (rangiiig typically from 0.4 to 2.5 kb). 

The conditions used for hybridization greatly 
influcuced the quality of the resulting normal- 
ized libraries a^nstructed with metiiod 4. Ttiis is 
to a great extent a consequence of the fact tliat 
we are using HAP to purify singlc-strandcd cin^lc.";, 
as opposed to a biotin-avidin capture $ysteui, 
which in our haiids yielded significantly less satis- 
factoi>' results (M.F. Bnnaldo and M.R. Soares, un- 
publ.). Hie best results were obtained when the hy- 
bridization conditions were tlie most similar to the 
HAP conditions. Wc interpreted these re^sults as 
suggestive of the fact that Imperfect hybrids 
formed during hybridization may either not bind 
to HAP and/or may melt once in the IfAl* buffer. 

It Is noteworthy tliat a much superior extent 
of normalization was obtained with method 4 
when single-stranded plasmid UNA prepared in 
vitro, as opposed to double-stranded plasmid 
DNA, was used as template for PCR ampUfication 
(not shown). These results suggest that a fraction 
of the ciouble-stranded plasmids used as template 
for PCR amplification, presumably In the form of 
melted supercoileU DNA, might end up in the 
HAP flow-through fraction (normalized library) 
during purification. 

It is noteworthy that cross-hj'brldizing di- 
verged sequences seem to escape riormalization 
in all of the procedures discussed above. For ex- 
ample, the frequency of Alu repeat-containing 
cUNAs (typically 10% in directioiially cloned 
cDNA libraries) is practically the same in starting 
arid normali7.ed libraries. These results suggest 
t|iat imperfect hybrids either do not bind to 
HAP under our conditions or melt once diluted 
in the (more stringent) HAP buffer. This is advan- 
tageous) not only because It preserves the repre- 
sentation of Alu-coiitainirrg cONAs that might 
correspond to otherwise rare mRNAs, but also, 
and most significant, because it minimizes the 
likelihood that a rare member of a gene family 
might be excluded hrom the final (noimalized 
or subtracted) library as a result of a cross-hy- 



brldizatlon with a more prevalent but diverged 
sequence. 

The use of normalized libraries for large-scale 
gene discovery/EST programs is beneficial be- 
cause it minimizes redundancies while increasing 
the representation of the rarer cDNAs by aboxit 
threefold, on average. However, given the great 
extent of overlap in gene expression among dif- 
ferent tissues, the \ise of normalized libraries 
alone is not sufficient to maintain a desirable 
pace of identification of novel sequences at ad- 
vanced stages of such programs. For tliis reason, 
we propose that the use of subtracted libraries 
enriched for clones not yet identified might be- 
come Increasingly advantageous. Towtiid this 
goal, we liave developed a subtractive hybridiza- 
tion approach designed specifically for this pur- 
pose (see Fig. 4). In a pilot experiment, we were 
able to reduce significantly the representation of 
-StXiO INrUS-IMAGE Consortium clones from 
the INFF-S library Itself (see Fig. 5). With the de- 
velopment of appropriate clustering algoritluns, 
the use of nonredtindant sets of cDNA/gene se- 
quences as drivers for hybridizations to generate 
subtrac-tive libraries enriched for novel sequences 
should soon become possible, and hopefully will 
facilitate the isolation of all human and mouse 
cDNAs still awaiting identification. 



Po!y(A)* RNA was puriQed from total cdJular RNA (except 
for senescent tibtoblasts from which cytoplasmic RNA was 
isolated) using tl\e Oiigotex niRNA Icit (Qiagcn) according 
to the manufacturer's InstrucUons, except tliat two founds 
of ptirifiCELtion were petformctl. cDNA librar>' cxinstnictioii 
was essentially as described before (Adams ct al. T993b; 
Soares 1991). Typically, 1 ngpoly(A)' RNA was annealed at 
37"C with a twofold mass excess of a Noa-ta(;-rdT),„ 
tniiuw [or P«(.l-tas-(dT),H hi tlia case of the llvtr/splwii 
liljtjryl and rever.se transcribed at ^7X: with .Superscript 
Reverse Traiisciiplase (Life Teclniologies). Alternatively 
poly(A)* I^NA was annealed at 4S"C witli a fourfold mass 
excess of a N«(l-ta(;-{dT),3 prlmet and reverse liaiiscribed 
nt 4:i"C Tlie tag is a sequence of 2-6 nucleotides that is 
imiqite for each library and thus serves as an identifier (see 
Table 1). WiH> the exception of infant brain, lelal liver/ 
spleen and teini placenta, all other first-strand cHNA 
syntheses were primed with the following oligomiclco- 
tldc; TCTTACCfVArCTGAAGTG GGAGCGGCCCiC-tag- 
(<iT),8 „ IS- The oiigoiiuclcoUde AACTGGAAGAATTCGC- 
C.GCCGCAGG,\i\(d-n,s (Pharmacia) was used to prime 
both Infant brain and tenn pbcenta first-strand cL^NA syn- 
the-sas. The alljjonucleotlae AACTGGAAGA^TTMTTAA^ 
GATCT(dT)m was used to prime the synthesis of flrst- 
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suand fetal Uvei/5pleen cDNA. Diiul>Uvstranded cDNAs 
were size-selectcd by gel filtration over a long (64-cm) and 
iiaitow (0.2-cia diauietei) Bio-Gtt A-50m (Bio-Rad, 100- 
200 mejh) coUniiri, and ligaHjd t<J 1 500- ti> 1000-Iold iiio- 
lai excess of adaplera. Infant brain cDNAS were Ugated to 
//i/idlll adapteirs, digested with Noll, size selected over a 
second Bio-Gel column, and cloned dIcccUonally Into tlie 
Hon and HindM .tltes of tJie Lafmid BA vectcir (Sonn^ et al. 
1994). Fetal liver/spleen cDNAs were Mgateil to FcnM 
adapters (Pharmacia), size-selected as above, digested \slth 
Fart and cloned dirccllonally Into the P<7Cl and £a)lU sites 
of the p Tm-J'ac vector. Alt other cDNAs were llgatcd Ui 
£coRI adapters (Pharmacia), size-selected as above, digested 
with NotI and cloned directloaally Into the Noll and £«oW 
sites of the pT7T3-Pac vector. pT7T3-Pac Is essentially the 
same as pT7T3iaD (Pharmacia) wilh a modified 
(Wlylinlcer. Figure 6 shows llie sequence of tlitr vVn'i-l'm 
polylinker and ftanltinx secjuenccs. 



Production of Purified Cxjvalently Closed 
Single-itrandcd Library DMA in Vitro 

Double-stranded phagemid DMA was converted to sJngle- 
sttanded circles by tlie iMrnblned action of Gene II (phage 
FI endonudease) and Exherichiii coK Exutiuckase 1)1 en- 
zymes, as per (lie manufacturer's Instructions (Life Tecli- 
nolo^ies; cat. no. 103S6-020). The resulting single- 
stranded circular DNA was purified from the lemaining 
double-stranded plasmids by HAP chiomatojjraphy (Uio- 
Rad) as rlescrilied prpvtoiisly (Soarc-s ct al. 1994). Tlie rep- 
lication initiator protein of isacteriopiiage fl (Uene 11) is a 
.slte-specJfic endonticlease tfiat- hinds to the fl origin in 
phagemid vectors and nicks the viral strand of the supcr- 
coiied DNA . The nicked strand is then digested from its A' 
end wltli Exonticlease 111 (Hohclscl 1993) to generate 
single-stranded circles. IHirlflcatlon of the resulting single- 
stranded drdes over HAP Is necessary because the conver- 
sion of supercoiled to related plasmids by Oene 11 is never 
complete, 'ttte Gene II reaction was performed for 1 hr al 
%CrC and contained typically 4 (ig suijercoiled plasmid li- 
brary DNA, 1 |J Gene II (Life Technologies), and 2 pJ 
lOx CJene II buffer (Ufe Technologies) in a total volume of 
20 111. The «ene 11 prt>tpin was heal innrtivated for 5 min 
at (iS'T^; ( he re-iction mixture was chUled on ice; 2 jil Exo- 
nudease 111 {Ufe Technologies, Cat. No. IROKVOli, SS 
units/piD wasaddcd: and the rcactioil was Incubated for 30 
mlt> at 37"C. Gene II and- Exonuclease ni were then di- 
gested vAVa Proteinase K (Boehrlnger Mannheim) for 15 
mln at 50*C in a 100-iU reaction containing 10 inM Tris 
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Figure 6 Sequence of the pT7T3-/>ac polylinker (uppercase) and tlanl<lng 
sequences (lowercase). 



(pH 7,8), 5 mM ethylenediamine tewaacetlc add (EDTA), 
0.5% SDS. and 136 vtg Protclna.<;e K. After extraction wllli 
equal volume of phenol-chlorofoun-lsoaiuyl alcohol <25: 
24:1), library DNA was etlwiiol-preci|jiliited and digested 
with f'wiU for 2 hr at 37*C Tills was done to convert the 
remaining supercoiled plasmids into linear DNA mol- 
ecules and ther«ti>' improve their blndability to IIAP tinder 
OTir coiidiiions. Notr that HviiM does not cleave single- 
stranded circles and tlnai there are two PvitW sites in the 
vector. I he reaction was diUiteil with 2 ml loiidtng Ijuffcr 
[0.12 M sodium phosphate buffer (pH6.8), 10 mM EDTA, 
and 1% SDS] and purified by HAP diromatography at 
60"C, using a c-oUirati pro-equlUbrated with the same 
buffer {1 ml bed vol.; 0-4 g of MAP). Aftera 6-ml wash with 
loading buffer, this volume was combined with the flow- 
ihrough fraction, and tlie satuple was exlracl«l twice wUh 
water-saturated 2-)Hitani)l, once with dry 2-bulanoI, and 
once with water-saturated ether (3 vok. per extraction). 
Residual ether ivas blown off by vacuum and the sample 
vi»s desalted by passage through a Nensrtcb column (D41- 
IVmc/Nl^N) according to the manufacturer'.'! specifications, 
concentrated down to -tl.S.S ml and cthanol-precipitated. 
Note that Gene !l-F,xonuclca.sc III prepared single- 
stranded DNA IS in the opposite polarity to stngle- stranded 
DNA generated by In vivo phagemid production. 



Production of Purified Covalontly Closed 
Single-stranded Library DNA in Vivo 
Plasmid DNA from the starling library was electroporated 
into £. DHSaF' bacteria, and Ihe cultuie v/as grown 
under ampiciUin .«!ection at 37=0 to an OD^n of 0.2, su- 
perinfected with a 10- to 20-fold excess of the helper phage 
M13K07 (Pharmjicia), and harvested after 4 hr for prepa- 
ration of single-stranded plasmids, as described (Vieira and 
Messing 1987). 



Conversion of Single-stranded Qrcles to 
Double-stranded Plasmids 

Single-stranded circles (<5n ng) were ethanol-precipitated 
and re-suspended in 1 1 \l\ vjater. Then 4 (il Sx Sequenase 
buffer (USB) and 1 (il primer (1 were added and the 
mixture was incubated at 65"<: for 5 mln and then at 37*C 
for 3 mln. Then 1 iil Sequenase version 2.0 (USB), 1 uJ O.l 
M dlihlothreltol (DTD, and 2 (J mixed dNIP stc<:k (a so- 
lution containing each deosynucleotide at a fhial concen- 
tration (jf 10 nj.M) were added, and the reaction was Incu- 
bated al i7~C tor 30 mln. The total 
volume was taken up to 100 k.1 with 
10 n'lM Trls (pH8.0) and 1 mw EDTA 
(TE) and the reaction was extracted 
once with phenol-chloroform- 
isoamyl alcohol (2S:24:1). Plasmid 
DMA was ethanol-precipitated and 
dissolved in 1 pJ I E. The following 
oligonudcotidcs were used for this 
primer extaision reaction: (1) Mii 
Reverse Sequencing Primer (5'- 
AGCCGATAACAATTTCACA- 
CACGA-S"), whldi Is complementary 
to single-stranded prepared in vitro, 
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and (2) Olifio-Aiiip (S'-GACTCGrGACTAClCAAC- 
CAXGTO-'S'), whicli is coiaplenieiitarj' to ihc anipictUIn re- 
sutaiiue gene of slnKle-jtraiKln.) pT7T3-Pac or Lafmid BA 
pU.tn)id5 prepared In vivo. 



Ln Vitro Synthesis of Library RNA 

Some 2-5 (ifi <rf double-itrandetl plasraid DNA from eithf r 
Ihe .ttarting lihrary (see melliodj 2-1 and 2-2 Ijelcjw) (ir i>ie 
mini-Ubrary of abiitKtartt cDNAs (see method 2-3 below) 
was linearized with either Pad (NliB) or Noll (NEB) and 
used as a template for synfhRsis of KNA with KilioK/lax 
Large Scale KNA Production Systenu T7 or 13 (I'tomega), 
according to the manufacturer's Instructions. After treat- 
ment with rlbonudca&c-frec DNAsc I (Proincga), to digest 
away the plasmld DNA tempiate, the RNA was used for 
hybrid i/a.tii>ii as dcscrihcd below. It should be noted that 
RNA synthesized Willi T7 RNA Polymerase Is In the mes- 
sageJike orlenUlion and is Loraplementary to tlie slngle- 
strandi'd circles produced hi vitro. On the other band, 
UNA synthesized tvitll RNA Polymerase is in the aii- 
timMSHge orientation and it is complementary lo sluxle- 
strandcd circles produced in vivn. 



Nomwiization Method 1 

rhf procpdnte used for construction of tlie nornialtzcd 
human infant brain (INIB) library (here designated as 
method 1) has been described pres'iously (S<iiirrs et al. 
199'1). Method 1, with minor modifications, was also ap- 
plied to construct the normalized human fetal liver/spleen 
cDNA library (INFLS). To synthesize a partial second 
strand of about 200 nt by limited extension, a 100 iil rc- 
acliod nilxture containing S (J O.S ii.g/iU Pw^ll-dlgcstcd, 
HAP- and gtH-puilfied sin^c-slrandcd plasmld DNA from 
the (elal liver/spleen starting library, 7 10 ng/|Ll ollgo 
(«IT),j,,, (PlMrmada), 10 p.1 lOxPxliuer Extension Biilfei 
(0.3 M Tris (pH 7.5), 0.5 M NaCl, and 0.15 M MgCl^), 10 n.1 
0.1 M rnr, 10 111 mixed dNTP stock, 25 (il ndxed ddNTP 
stock (a solution containing each dideoxy A. C, and G at a 
final concentration of 25 mM), 5 jlI 800 Ci/mmole 
|a-"P]dCTP, and 20.5 id water wa.s inciihaled nt fifTC for 5 
mln, at SO"C for IS mJn, and at 37'C. for 2 min. men 7.5 
liJ 5 unlts/MJ Klcnow enzyme (USB) was added, and the 
reaction was hicubated at 37'C for 30 roln. Tbe reaction 
was extracted with phenol-chlotofonn-lsoamjrt alcohol 
(25:24:1), 5 m-b melted and sheared salmon sperm DNA was 
added, and the partially double-stranded plasmlds were 
purified from the teniaiiiiiig single-stranded circles (un- 
primetl mulec\des, as well as clones derived from mENAs 
wiflVan internal Fiii:'. site that therefore do not contain an 
oiigo(dA) tail al the 3" end) by MAP throinatogra|jhy. Tlie 
H'AP-boimd fraction containing the partially double- 
.straiided plasmids was eUited with 6 ml 0.4 m sodium 
piiosphatc.buffcr{pH 6.8), 10 m M P-DfA. and 1%SDS, and 
plasmid DNA was desalted as described before (Soares et al. 
1994) and cthanol-ptcdpitated. The DNA (17.1 ng) was 
resuspended in 2.5 ^1 deionizcd formamlde. and melted at 
80^ for 3 mln under 10 pJ mineral oil. llien I (il tif S 
>6/ijJ oUgo(dl'),3.,K (used to block the talis) was added, 
and the mixture was heatt>d at 80°C for 1 mln. Then 0.5 |il 
5 VI NaCl, G.5 |i.l lOx TE, and O.S pJ water were added, and 
the reassoclatlon reaction was Incubated al 42*C for 0.6 hr 



(calculated IV = <> S)- The remaining single-stranded 
circles were purified over HAP (flow-through fraction) and 
subjected subsequently to a second cycle of the nonnai- 
Izatlon procedure as described above, except that reasso- 
clatlon was conducted fur 24 hr (calculated Q,* Tl'^ 
remaining single-stranded circles (noritiallzed library; 
INFLS) wexv purified over HAP, converted to double- 
stranded jJiismids, cleclroporalcd into CHIOB bacteria, 
and propagated under ampicillin selection. 



Nonrallzatlon Methods 24, 2-2, and 2-3 

Method 2 is a reassociation Viiiictics-lxtscd approach in- 
volving hybridization of ui vitro synthesized RNA (the 
driver) derived either from the entire library (methods 2-1 
and 2-2; see Fig. 2) or from a rnhil-IHirary enriched for 
abundant cDNAs (method 2-3; see Pig. 2), wltli the who)e 
starting library in the form, ol single-slranded circles (the 
tracer). The rejiiaining single-slranded ciidts (iioriiializeO 
llbratj') are purified by HAP chromatography (HAP flow- 
thmiigh fraction), converted to double-stranded plasmids 
for improvement of electmiioration efficiency, elcctroimr- 
ated into DHIOB bacteria (Life Technologies), and propa- 
gated under ampicillin selection. A number of normaliicd 
cDNA libraries were consiruacd with these methods using 
slngle-saanded plasmids prepared both Ir. ^vo and in 
vitro (see Table 1). In all three variants, the driver was first 
pre-annealcd with a pair of oligonucleotides to bl<x:k both 
5' and 3' vector sequences as follows: O.S |i.l (10 m-S) of eadi 
oligonucleotide, 1 jtl RNA (S.O ng in methods 2-1 and 2-3; 
0.5 (Lg in metliod 2-2), and 4.0 ^l deionized (orinamide 
were heated for 3 min al 80°C uiKlei 10 jlI mineral oil and 
quicjjiy chilled on ice. Then 0.8 ixl lOx hybridization 
buffer [0.4 M Hpes (pH 6.4), 4 M NaCi, and 10 mM EDTA in 
methods 2- 1 and 2..1r 0.4 m Viiies (ptl 6.4), 1.2 m NaCl, 10 
m-M EDTA, and 1% SIJS in method Z-2), O.S |U RNAsin 
(Doehringcr Mannnhcim), and 0.7 (d water were added 
and the mixture (total volume, S p-l) was incubated ovcr- 
nlj?ht at 42°C (methods 2-1 and 2-3) or 30°C (method 2-2). 
In another tube, 2.5 iaI (SO ng) single stranded library DNA 
in deionized fbrmamide was heated for 3 mln at 80°C un- 
der mineral oil; O.S 10 X hybridization buffer and 2.0 tU 
water were added; and the mixture was transfened to the 
tube containing the pruanncalcd RNA. Hybridization (13- 
pi reaction) was performed at 42°C (method 2-1: Q,! = 
S-10; method 2-3: Q,t = 100-200) or at 30^ (method 2-2: 
Qt = .S-1 0). The d river, rather than the tracer, was blodicd 
because otlierwise tlie latier would, to some extent, bind to 
HAP during purification. The plasmid mini-librar>' en- 
riched for abundant cDNAs th,it served as a templah- for 
the synthesis of RNA used as driver in method 
2-3 was prepared from the HAP-bound fraction obtained 
during purification of the normalized library In method 
2-1. Different paire of blocking oligonucleotides were used, 
depeiiduig on whether the RNA was synthesized wltli 
T3 or T7 RNA polymerases. To blot* KNA synthesized with 
T3 RNA polymerase, which was used in hybridizations 
Willi single-stianded plasmlds prepared in vivo we used: 
5'-,,ACj-GGCGGCCGCAAGCTTATTCCCTTTACT- 
CAGGO-n AAr-3' (this oligonucleotide was used to blocic 
S' vector sequences of all but the huirian fetal lixer/spleen 
library RNA), and S'-,,AGATCTTTAATTAAGCGGCCG- 
CAACCn A ITCfXTl l rAGTG.^GUU^^AA■L■-3' (this oligo- 
nucleotide was used to block 5' vector sequences of the 
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Human fetai livei/splccri library ilNA), and S'-AGG- 
CCAAGAATrCGGCACGAG-3' (,thin oligonucleotide was 
usutl ti> block 3' vector setpicnces). To block RNA synthe- 
si2ed ivlth'r? RNA polyitip.rase, which was used )n hybrld- 
izatkins with singlK-stranded plosmtds prepared In viuo 
we used: S'-CCTCGTCCCGAATTCTIGGCCTCGAG- 
GGCCAAATTCCC-3' (this oUgomideotlOe was used to 
block S' vector sequences). The ollyonucleotiJc used to 
prime Ihc synthesis ot flist-sttaiid cDNA was also us«l to 
block 3' v( 



Normalization Method 3 

Method 3^ used to genetate rtie noanalized libiary Ironi 
multiple JCletOSlS plaques {2NtHMSl'), is a reassociafioii- 
kinetlcs based jpproadi involving hybridiratinn tQ,t = 
20-25) of H 20-fold excess of Exomiclense Ill-digested 
CONA Inserts excised from a plasmld DNA pteparatlon of 
the stalling library with the library itself in the form of 
single-stranded circles, followerl by HAP-purlflcatlon of 
ttie remaining single-stranded plasinlds, conversion lo 
double-strands, and electrbporation Into bacteria. Some 5 
lijjdouble-slranded pkistnid DNA from the slartiuti libiary 
was lioubly digested with Notl and EcoRi; Uie excised 
c.DNA Inserts were s^arated from the cloning;; vector by 
agarose gel electrophoresis; and the DNA was purified us- 
ing bcta-agarase (NEB) according to the manotachirer's in- 
structions. Then 0.6 ng gel-purified double-stranded cDNA 
Inserts in 47.5 p.1 TE was digested with F.^onucleasc ill at 
37'C for 30 rain in a OO-iil reaction contolning 6 nl 
lOx Kxcnuclease HI buffer [0.5 M Tris (pH 8.0) and SO niM 
MgClj)!, 0.6 0.1 M UTT, 2.9 jll water, and 3 (J of 65 
unils/|il Exonudease III (Life frchnologies). The Exo- 
nuclease was tticn digested with 135 ng Proteinase K (Boe- 
hringer Mannhctlm) at S0°CfOr 15 min In a 100-nl reaction 
containing 10 mM Tris (i>H 7,8), 5 mM EDTA, and 0.5% 
SOS. After two extractions with phenol-tliloroform- 
isoamyl alcohol (2S:24:l), the resulting noiicoinplemen- 
tar)' siriglc-stranded DNA (total amount -03 jjigj was etha- 
nol-prcdpltated and rcsuspeiided in I (J TE. A S-(il hy- 
bridization reaction was then set up as folkiws: 1 ji.t 
Exon\iclease Ill-dlgesled cDNA Inserts (an estimated 
amount of 150 ng of single-stranded DNA) and SO ng 
■ slngle-sttanded plasmld DNA (mm the starting multiple 
sclerosis plaques library (prepared in vitro) In 2.5 ir.1 delon- 
ized formamlde were mixed and heated at 80'C lor 3 inln 
under 10 |il mineral oil. Then 0.5 iiA (10 |ig) of a blocking 
oligoniicleoflde (5'-CCTCGTGCCGAATTCTTGGCCrrC- 
GAGGGCrAAATTCCCTAT.\GTGAGrCGTATrA-3'), 0.5 
.S !U NaCl, and 0.5 tU lOx 'I E were added, and tire mbc- 
iurc was Incubated at IZ'C for 41 hr (calculated Ctf. ol 23). 
The remaining single-stranded plasaiids were purified by 
HAP chromatography, converted to double-stranded plas- 
niids, anid elect roporated into DHIOB bacteria (Life Tech- 
nologies) as described above. 



Normalization Method 4 

This is a reassoclatlon4unetic5-based approach involving 
hybildlzatlon of a 20-folU excc-ss o( cDNA inserts gf.nerated 
by PGR with the Bbrary Itself in the form nf .single-stranded 
clrclKs. followed by HAP purification of the remaining 
slngle-sttaudcd plasmids, ronvrrrilrtn to " 



elcctfopoiation into DHIOB bacteria, and amplification 
under amplciJlin selection. I'CR amplification of cDNA in- 
serts was performed using the Exiiand High Fltlellty PGR 
System (Boehringer Mannheim) according to the tiiauu- 
factmer's instructions. This PGR system is composed of an 
enzyme mixture containing thermostable Taq DNA and 
Pwo DNA polymerases (Barnes 1994). An amount of 1 
(2.5-S.O ng) DNA template IdouUe-stranded plasmids (te- 
tal lung, parathyroid aderionia, senescent fibroblasts) or 
.single-stranded circles preiwrcd in vitro (fetal heart, 
t4Nb2HFLS20W-fetal liver/spleen, and all mouse, rat. and 
SChlstosnme iib.'arics ILsted In Tabic 1)) vcas mlxi^d with 2 
ixl dNTP stock (t) le fUial concen Iratlon of eadi dNTP In the 
reaction is 200 |ii^), 5 |il of a 20-|Ui solution o( T7 Primer 
(5'-rAATACG.\CTCACTATAGCC-J'), 5 til of a 20-iiM so- 
lution of '13 Primer (5'-A ITAACCCrrCACrAAAGGGA-J'), 
10 tJ lOx I Pjspand High Fidelity buffer, 0.75 >il Expand 
High Fidelity enrynie mix (2.6 units), and 76.25 nl water. 
Then SO mineral oU was added and the reaction mixture 
was subjected lo the following amplification cycle condi- 
tions in a Perkln Elmer Theiniocycler: 7 min while ramp- 
ing up from room temperaluio to 94°C; 20 cycles of 1 mm 
at 9rC 2 mm at SS-C, and 3 nun at 7yC, and 7 min al 
72"G. PCR-ampllfied fragments were purified using the 
High Pure PGR Pioduct Purification Kit (Korhringer Man- 
nheim) as liisuuctcd by the msniitacturct. The purified 
rCR product was ethanol-precipitatoil and dissolved In 5 
(il TE. Then l.S m.1 (O.S |i,g) PGR iirodticts was mixed with 
5 (U (50 ng) library DNA (single-Stranded circles prepared 
In vitro) In deionlzed formamide, O.S p:) (10 pg) S' block- 
ing oligo AV-l (.S-.CCTCGrGCX:;GAAlTCTTGGCCTC- 
<;AUGGt7.CAAATTCCCTATAGTGAGTCGTATrA-3'). O.S 
III (10 iig) 3- blocking ollgo AR (S'-ATTAACCCTCAC- 
• JAAAGGGAA TAAGCTTGCGGCCGCTj„-3'; used for all 
but the fetal Ilver/splcen library), or alternatively, (O.S )il 
(10 (i-g) 3' blocking ollgo AV-2 (S'-ATI AACCCTTAC- 
TAAAGGGAATAAGCTTGCGGCCGCTTAATTAAA- 
GATCT„-3'; used only for the fetal liver/spleen library), 
and this mbcture was Ijcaled al 8C"C for 3 min under 10 p.1 
of mineral oil. Then 1 ^l1 lOx biiffcr-A (1.2 m NaO, 0.1 M 
Tris (pH a.O), and SO mM ED I A; used for fetal Jung, fetal 
heart, parathyroid adenoma, senescent fibroblasts, arid 
19.5-days postconcoolion (dpc) mouse embryoj or, alter- 
naUvely, 1 (ilI ]0x huffcr-B [1.2 M NaCl, 0.1 M Tris (pH 8.0), 
SOmw U)TA, and lO'X.SDS; used for 14Nb2HFLS20W-fetal 
liver/.splccn, 17.5-dpc mouse embryo, 13.5- lo 14.5-dpc 
mouse embryo, rat heart, rat kidney, and 8-wcek si^histo- 
somc], and 1.5 |i.l water were added, and the hybridization 
was performed at 30°C for 24 hr (calculated C^t - S). The 
remaining single-stranded circles were piirifiei3 by HAP 
chromatograpliy, converted to double-st-i«nd.s, and elec- 
ttopoiated Into DHIOB (Life Technologies) bacteria, as de- 



Siibtractlve Hybridization 

Double-stranded plasmld DNA fiorn a pool of 4992 tloni-s 
grown individually In 384 well plates (IMAGE Consortium 
plates LL-\M 78-yO, identification uos. 66096-67079 and 
10816& 112775) derived from the noimaliwd fetal liver/ 
spleen library (INFLS) was prepared using the Otagcn 
Mldi-i>rep kit according lo llie manufactiirer's Instruc- 
Uons, and converted lo single-strand«i circles in vitro, as 
described above. Single-stranded circles were purified by 
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HAl' chromatography ond used as a template foi PCH am- 
pllflcatloii with T7 and T3 primers, as Uocrlbf d >bovr.. An 
amounl of 1.5 ng of PCR-arapllfied cDNA Inserts from the 
IXAM 78-90 pool (ItJ 4 iil deloalzed foimamidc) was 
mixed with SO iig of stii(>le-strandcd circles froin the 
INFLS Ubraiy (in 2 jil iJelonized formamlde), 2.1 m.1 (12 jig) 
S' blocking olifso AV-1, and 2.1 (42i4j) 3' blociOng ollgo 
AV.2. Then 10 ^lI mineral oil was added, and tha mixture 
was heated it SU°C for 3 miii. Tlieii 1.2 m.1 lOx buffer-B 
and 0.6 111 water were Added, »nd tlie hybridization was 
licrforircd nl for 4tt hr (calculated Qt = 27). The k- 
racining sing!e-siranded cirrlfis were purified over HAP, 
converted to double-strands, electroporatsd into UHIOB 
Iracusrla, and projMgatcd under anipirillin selection to 
geneiate the subtracted llvcr/splccii library (INFtii-Sl). 
MAl'-bound DMA was also processed and purified for use in 
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